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Identities = 78/96 (81%) , Positives = 87/96 (90%) 

Query: 2 QSIDLLVTOAGLMiGLDKSYEADFGDWMTMIN^^ 61 
Q I +LVNNAGLALGLDK+YEADF +WMTMINTN+VGLIYLTR +LP MV + G+IINL 
5 Sbjct: 82 QDITIL^fflNRG^ALGLDKAYEaDFEtIWm , MImSIIVGI.I■^LTRQLLPH^IVSKDDGIIINL 141 

Query: 62 GSXAGTIPYPGANWGASKAWKQFSLNLRMLAGT 97 

GS AGTIPYPGAN+YGASKAFVKQFSLNI1RADLAG+ 
Sbjct: 142 GSTAGTIPYPGANIYGASKAFVKQFSLiruRADLAGS 177 

10 

Based on this analysis, it was predicted that these proteins and their epitopes could be useful antigens for 
vaccines or diagnostics. 

Example 2339 

A DNA sequence (GBSx2492) was identified in S.agalactiae <SEQ ID 7171 > which encodes the amino 
15 acid sequence <SEQ ID 7172>. This protein is predicted to be mercuric reductase. Analysis of this protein 
sequence reveals the following: 

Possible site: 53 

>» Seems to have no N-terminal signal sequence 

20 Final Results 

bacterial cytoplasm Certainty=0. 2115 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

25 The protein has homology with the following sequences in the GENPEPT database. 

>GP:CAC14663 GB:Y10855 mercuric reductase [Bacillus licheniformis] 
Identities = 68/104 (65%) , Positives = 82/104 (78%) 

Query: 1 MNKFKVNISGMTCTGCEKHVESALEKIGAKNIESSYRRGEAVFELPDDIEVESAIKAIDE 60 
30 M K++VN+ GMTCTGCE+HV ALE 4-GAK IE YRRGEAVFELP+ +EVE+A KAI E 

Sbjct: 1 MKKYRVNVQGMTCTGCEEHVAVALENMGAKRIEVDYRRGEAVFELPNGLEVETAKKAIAE 60 

Query: 61 ANYQAGEIEEVSSLENVALINEDNYDLLIIGSGAAAFSSAIKAI 104 
A YQ GE EEV S E + L +E +YD +IIGSG AAFSSAI4A+ 
35 Sbjct: 61 AKYQPGEAEEVQSQELIQLGDEGDYDYIIIGSGGAAFSSAIEAV 104 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

40 Example 2340 

A DNA sequence (GBSx2494) was identified in S.agalactiae <SEQ ID 7173> which encodes the amino 
acid sequence <SEQ ID 7174>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

>» Seems to have no N-terminal signal sequence 

45 

Final Results 

bacterial cytoplasm Certainty=0, 3341 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

50 

The protein has no significant homology with any sequences in the GENPEPT database. 
No corresponding DNA sequence was identified in S.pyogenes. 
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Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2341 

A DNA sequence (GBSx2495) was identified in S.agalactiae <SEQ ID 7175> which encodes the amino 
acid sequence <SEQ ID 7176>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

»> Seems to have no N-terminal signal sequence 
Final Results 



- Certainty=0. 4989 (Affirmative) . 

- Certainty=0 . 0000 (Not Clear) < i 



bacterial outside --- Certainty=0. 0000 (Not Clear) . 

The protein has no significant homology with any sequences in the GENPEPT d 
No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2342 

A DNA sequence (GBSx2496) was identified in S.agalactiae <SEQ ID 7177> which encodes the amino 
acid sequence <SEQ ID 7178>. Analysis of this protein sequence reveals the following: 

J-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2569 (Affirmative) < succ> 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2343 

A DNA sequence (GBSx2497) was identified in S.agalactiae <SEQ ID 7179> which encodes the amino 
acid sequence <SEQ ID 7180>. This protein is predicted to be DNA polymerase III alpha subunit (dnaE). 
Analysis of this protein sequence reveals the following: 
Possible site: 60 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 3124 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

A related DNA sequence was identified in S.pyogenes <SEQ ID 4095> which encodes the amino acid 
sequence <SEQ ID 4096>. Analysis of this protein sequence reveals the following: 

N-terminal signal sequence 
■ Final Results 
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bacterial cytoplasm.--- certainty=0 . 2600 (Affirmative) <succ> 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

An alignment of the GAS and GBS proteins is shown below. 

Identities = 186/237 (78%) , Positives = 214/237 (89%) 

Query: 10 DPVKHNLIFERFLNEERYSMPDIDIDLPDIYRGEFLRYVRNRYGSMHSAQIVTFSTFGAK 69 

DPV+H+L+FERFLN+ERYSMPDIDIDLPDIYR EFLRYVRNRYGS HSAQIVTFSTFG K 
Sbjct: 321 DPVQHDLIiFERFliNKERYSMPDIDIDLPDIYRSEFLRYVRNRYGSDHSAQIVTFSTFGPK 380 

Query: 70 QAIRDVFKRFGASEYELTNITKKIHFRDNLTSVYNRNLAFRQIIDSKIEYQKAYDIAKRI 129 

QAIRDVFKRFG EYELTN+TKKI F+D+L +VY ++++FRQ+I+S+ E+QKA+ IAKRI 
Sbjct: 381 QAIRDVFKRFGVPEYELTNLTKKIGFKDSLATVYEKS I SFRQVINSRTEFQKAFAIAKRI 440 

Query: 130 EGNPRQTSIHAAGVVMSDDLLTDHIPLKNGED^MITQYDASSvEDNGLLKMDFLGr J RNLT 189 

EGNPRQTS IHAAG+ VMSDD LT+HIPLK+G+DMMITQYQA +VE NGLLKMDFLGLRNLT 
Sbjct: 441 EGNPRQTSIHAAGIVMSDDALTOHIPLKSGDD^MITQYDAHAVEANGLLKMDFLGLRNLT 500 

Query: 190 FVQKMKEKTOKDYGISIQLETIDLEDKETLKLFAAGQTKGIFQFEQSGAINLLRRIR 246 

FVQKM+EKV KDYG I + IDLED +TL IiFA G TKGIFQFEQ+GAINLL+RI+ 
Sbjct: 501 FVQKMQEKVAKDYGCQIDITAIDLEDPQTLALFAKGDTKGIFQFEQNGAINLLKRIK 557 

Based on this analysis, it was predicted that these proteins and then epitopes could be useful antigens for 
vaccines or diagnostics. 

Example 2344 

A DNA sequence (GBSx2498) was identified in S.agalactiae <SEQ ID 7181> which encodes the amino 
acid sequence <SEQ ID 7182>. This protein is predicted to be a methylase. Analysis of this protein 
sequence reveals the following: 

5 N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2121 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

The protein has homology with the following sequences in the GENPEPT database. 

>GP:AAG21729 GB:AF116907 putative methylase [Corynebacterium hoagii] 
Identities = 48/160 (30%), Positives = 85/160 (53%), Gaps = 6/160 (3%) 

Query: 97 EPDDSENGHNDTDLEETDNQIPEEEVVETIPEIPVTDFYFPEDLTDFYPKTARDKVETNI 156 

EP+ + E + + ++E +P TDF D+ P A+ +V NI 

Sbjct: 1235 EPEAPTQPEAASAAETAEPAVEQQEPRAGPQSVPATDFALGTDV--HVPSGAKARVRANI 1293 

Query: 157 VAIRLVKNLEVEHRNASPSEQELluAKYVGWGGLiANEFFDD YNPKFSKEREELKSLVT 213 

A RLV L+ + R A+ EQ +LA++ GWG + E FD+ + +++ ER L L+ 
Sbjct: 1294 AAARLVIjELDEQQRPATAEEQAVLACjWSGWGAVP-EVFDNRSKFLSEWADERAALLDLLG 1352 

Query: 214 DKEYSDMKQSSLTAYYTDPSLIRQMWGIVERDGFTGWQIL 253 

+K +S ++++L A+YTDP+++ ++W V+R G +L 
Sbjct: 1353 EKGFSQARETTLNAHYTDPAIVGELWRAVQRAGLPDGALL 1392 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 
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Example 2345 

A DNA sequence (GBSx2499) was identified in S.agalactiae <SEQ ID 7183> which encodes the amino 
acid sequence <SEQ ID 7184>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

>» Seems to have no N-terminal signal 



Final Results 

bacterial cytoplasm Certainty=0 .1111 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

The protein has no significant homology with any sequences in the GENPEPT database. 
No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2346 

A DNA sequence (GBSx2501) was identified in S.agalactiae <SEQ ID 7185> which encodes the amino 
acid sequence <SEQ ID 7186>. Analysis of this protein sequence reveals the following: 

o N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 .4752 (Affirmative) < suco 

bacterial membrane — Certainty=0 .0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

The protein has homology with the following sequences in the GENPEPT database. 

>GP:CAA61516 GB:X89232 DNA-directed RNA polymerase [Pediococcus 
acidilactici] 
Identities = 48/53 (90%) , Positives = 52/53 (97%) 

Query: 5 KKPET INYRTLKPEREGLFDEVI FGPTKDWECACGKYKRI RYKGI I CDRCG VE 57 

KKPETINYRTLKPE++GLFDE 1FGPTKD4-ECACGKYKRIKYKGI +CDRCGVE 
Sbjct: 29 KKPETINYRTLKPEKDGLFDERIFGPTKDYECACGKYKRIRYKGIVCDRCGVE 81 

There is also homology to SEQ ID 384. 

Based on this analysis, it was predicted that these proteins and their epitopes could be useful antigens for 
vaccines or diagnostics. 

Example 2347 

A DNA sequence (GBSx2502) was identified in S.agalactiae <SEQ ID 7187> which encodes the amino 
acid sequence <SEQ ID 7188>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 3080 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=Q .0000 (Not Clear) < suco 



50 The protein has homology with the following sequences in the GENPEPT database. 
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>GP:AAC00282 GB:AF008220 YtlR [Bacillus -subtilis] 
Identities = 61/216 (28%), Positives = 98/216 (45%), Gaps = 28/216 (12%) 

Query: 8 IPCTYYPVGSGNDFARALKIPNL KETLTAIQTERLKEINCFIYDKGLIL- - 56 

I ++ P G+ NDF+R I + K LT +T L +N F+ DK IL 

Sbjct: 86 IELSFVPAGAYNDFSRGFSIKKIDLIQEIKKVKRPLT--RTFHLGSVN-FLQDKSQILYF 142 

Query: 57 -NSLDLGFAAYWWKASNSKIICNILNRYRLGKITYIVIAIKSLLHSSK -VQVLVE 109 

N + +GF AYV KA ++ + RL + V + S LH+S + E 

Sbjct: 143 MNHIGIGFDAYWKKAMEFPLRRVFLFLRLRFLVYPL SHLHASATFKPFTLACTTE 198 

Query: 110 GETGQQIKLNDLYFFALANWTYFGGGITIWPKASALTAELDMVYAKGHTFLKRLSILLSL 169 

ET + +D++F ++N+ ++GGG+ P A+ D+V + FLK+ +L + 

Sbjct: 199 DETF^---FHDWFAWSNHPFYGGGMKAAPLANPREKTFDIVIVENQPFIjKKYWLLCLM 255 

Query: 170 VFKRHTTSKSIKHQTFKAMTVYFPKNSLIEIDGE^V 205 

F +HT + K +T Y DGEI+ 

Sbjct: 256 AFGKHTKMDGVTMFKAKDITFYTKDKIPFHADGEIM 291 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 



Example 2348 

A DNA sequence (GBSx2503) was identified in S.agalactiae <SEQ ID 7189> which e 
acid sequence <SEQ ID 7190>. This protein is predicted to be protease subunit HflC (hflC). Analysis of this 
protein sequence reveals the following: 

N- terminal signal sequence 

30 Final Results 

bacterial cytoplasm Certainty=0 .1809 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 The protein has homology with the following sequences in the GENPEPT database. 



Query: 1 MSQJTEFAVLLQFGKWQTDTOPGLHV1WPY\^QVRKFDGRLLTLDAPTQRFLTLEKKAVM 50 

■i- QTERAV+L+FG+W++DVKPGLH K+PYVNQVRKFD RLLTLDAPTQRFLTLEKKAVM 
Sbjct: 26 VQQ/TEPJWMLRFGRWESDVKPGLHFKIPY\^IQVRKFDARLLTLDAPTQRFLTLEKKAVM 85 

Query: 61 VDAYAKWRVKDAERFYTATSGDKQIADERLSRRLESGLRDQFGKRTLHEWSGERDALMA 120 
VDAYAKWRV DAERFYTATSGLKQIADERLSRRLE+GLRDQFGKRTLHEWSGERDALM 

Sbjct: 86 

Query: 121 DITGSLNRMAEKELGIEVLDWVKAIDLPK 

DIT SIJSIR^IA+KELGIEV+DVRVKAIDLPKEVNRSVFERMSTEREREAREHRAKG EL E 
Sbjct: 145 DITASIMW1AQKELGIEVIDVRVKAIDLPKEVNRSVFERMSTEREREAREHRAKGRELAE 205 

Query: 181 GIRADADRQRRVLLAEAYRESE 202 

GIRADADRQRRV++AEAYRESE 
Sbjct: 206 GIRADADRQRRVIVAEAYRESE 227 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 
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Example 2349 

A DNA sequence (GBSx2504) was identified in S.agalactiae <SEQ ID 7191> which encodes the amino 
acid sequence <SEQTD 7192>. Analysis of this protein sequence reveals the following: 

Possible site: 24 
5 »> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

10 bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

The protein has no significant homology with any sequences in the GENPEPT database. 
No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
15 vaccines or diagnostics. 

Example 2350 

A DNA sequence (GBSx2505) was identified in S.agalactiae <SEQ ID 7193> which encodes the amino 

acid sequence <SEQ ID 7194>. This protein is predicted to be ABC transporter (ATP-binding; 

daunorubicin resistance). Analysis of this protein sequence reveals the following: 

20 Possible site: 56 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1846 (Affirmative) < suco 

25 bacterial membrane Certainty=0 . 0000 (Hot Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

The protein has homology with the following sequences in the GENPEPT database. 

>GP:CAB15892 GB:Z99123 similar to ABC transporter (ATP-binding 
30 protein) [Bacillus subtilis] 

Identities = 88/231 (38%) , Positives = 132/231 (57%) , Gaps = 13/231 (5%) 





10 


QVIGYLPDVPKFYDWAQEYLQLC- - -AGLAQNKTSLPIADLLEQVGLADN-QQRISTY 


G5 






++IGYLP P FY +MTA E+L +GL++ K I ++LE VGL + +RI Y 




Sbjct: 


69 


RLIGYLPQYPAFYSWMTANEFLTFAGRLSGLSKRKCQEKIGEMLEFVGLHEAAHKRIGGY 


128 




66 


SRGMKQRLGLAQALIHXXKILICDEPTSALDPQGRQEILSIISQLRGQKTVIFSTHILSD 


125 






S GMKQRLGLAQAL+H K LI DEP SALOP GR E+L ++ 4L4 V+FSTH+L D 




Sbjct: 


129 


SGGMKQRLGIAQALLHKPKFLILDEPVSALDPTGRFEVLD^ELKKHMAVLFSTHVLHD 


188 


Query: 


126 


VEKVCDQVLILTKSGIH- - -NLEDLRDKaSASVNQJ^NLBIKVSDNEAQKLALRFPLNQKD 


182 






E+VCDQV+I+ I L+4L+ + +V L++ K+ +K + + + 




Sbjct: 


189 


AEQVCDQWIMKNGEISWKGELQELKQQQQTNVFTLSVKEKLEGWLEEKPYVSAIVYKNP 


248 






QYYKVHLELSEANNREQALASFYRYLVEQEITPYFIELLEDSLEDFYLEVI 233 








+ EL + + L+ + + +T E +SLED YL+V+ 




Sbjct: 


249 


S - -QAVFELPDIHAGRSLLSD CIRKGLTVTRFEQKTESLEDVYLKW 293 





There is also homology to SEQ ID 686. 

50 Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 
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Example 2351 

A DNA sequence (GBSx2506) was identified in S.agalactiae <SEQ ID 7195> which encodes the amino 
acid sequence <SEQ ID 7196>. Analysis of this protein sequence reveals the following: . 

Possible site: 52 
5 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0679 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

10 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

The protein has homology with glycine-rich cell wall proteins (e.g. GB:AL161589 - the glycine-rich cell wall 
protein from Arabidopsis thalianid) and to SEQ ID 6882. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
15 vaccines or diagnostics. 

Example 2352 

A DNA sequence (GBSx2507) was identified in S.agalactiae <SEQ ID 7197> which encodes the amino 
acid sequence <SEQ ID 7198>. Analysis of this protein sequence reveals the following: 

Possible site: 35 
20 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2890 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
25 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

The protein has no significant homology with any sequences in the GENPEPT database. 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
30 vaccines or diagnostics. 

Example 2353 

A DNA sequence (GBSx2508) was identified in S.agalactiae <SEQ ID 7199> which encodes the amino 
acid sequence <SEQ ID 7200>. Analysis of this protein sequence reveals the following: 

Possible site: 60 
35 »> Seems to have no N-terminal signal sequence 

' Final Results 

bacterial cytoplasm Certainty=0. 2410 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

40 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

A related GBS nucleic acid sequence <SEQ ID 9329> which encodes amino acid sequence <SEQ ID 9330> 
was also identified. 

The protein has no significant homology with any sequences in the GENPEPT database. 
45 No corresponding DNA sequence was identified in S.pyogenes. 
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SEQ.ID 9330 (GBS678) was expressed in E.coll as a His-fusion product. SDS-PAGE analysis of total cell 
extract is shown in Figure 163 (lane 18; MW 53kDa), Figure 164 (lane 2 & 3; MW 53kDa) and Figure 188 
(lane 7; MW 53kDa). Purified protein is shown in Figure 242, lanes 6 & 7. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
5 vaccines or diagnostics. 

Example 2354 

A DNA sequence (GBSx2509) was identified in S.agalactiae <SEQ ID 7201> which encodes the amino 
acid sequence <SEQ ID 7202>. This protein is predicted to be surface protein Pub. Analysis of this protein 
sequence reveals the following: 

10 Possible site: 24 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 2025 (Affirmative) < suco 

15 bacterial membrane Certainty=0. 0000 (Hot Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
20 vaccines or diagnostics. 

Example 2355 

A DNA sequence (GBSx2510) was identified in S.agalactiae <SEQ ID 7203> which encodes the amino 
acid sequence <SEQ ID 7204>. This protein is predicted to be surface protein Rib. Analysis of this protein 
sequence reveals the following: 

25 Possible site: 24 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1B92 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
35 vaccines or diagnostics. 

Example 2356 

A DNA sequence (GBSx2511) was identified in S.agalactiae <SEQ ID 7205> which encodes the amino 
acid sequence <SEQ ID 7206>. This protein is predicted to be surface protein Rib. Analysis of this protein 
sequence reveals the following: 

40 Possible site: 24 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1892 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2357 

5 A DNA sequence (GBSx2512) was identified in S.agalactiae <SEQ ID 7207> which encodes the amino 
acid sequence <SEQ ID 7208>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

»> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 . 0999 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 The protein has no significant homology with any sequences in the GENPEPT database. 
No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2358 

20 A DNA sequence (GBSx2514) was identified in S.agalactiae <SEQ ID 7209> which encodes the amino 
acid sequence <SEQ ID 7210>. This protein is predicted to be surface protein Rib. Analysis of this protein 
sequence reveals the following: 

Possible site: 24 

>» Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=C . 1892 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2359 

35 A DNA sequence (GBSx2515) was identified in S.agalactiae <SEQ ID 7211> which encodes the amino 
acid sequence <SEQ ID 7212>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>» Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm Certainty=0 .2041 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

45 The protein has no significant homology with any sequences in the GENPEPT database. 
No corresponding DNA sequence was identified in S.pyogenes. 
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Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
vaccines or diagnostics. 

Example 2360 

A DNA sequence (GBSx2516) was identified in S.agalactiae <SEQ ID 7213> which encodes the amino 
5 , acid sequence <SEQ ID 7214>. This protein is predicted to he 3 OS ribosomal protein S6 (rpsF). Analysis of 
this protein sequence reveals the following: 

Possible site: 51 

>>> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0. 3607 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

1 5 A related GBS nucleic acid sequence <SEQ ID 9423> which encodes amino acid sequence <SEQ ID 9424> 
was also identified. 

The protein has homology with the following sequences in the GENPEPT database. 

>GP:CAB16128 GB:Z99124 ribosomal protein S6 (BS9) [Bacillus subtilis] 
Identities = 41/72 (56%), Positives = 58/72 (79%), Gaps = 1/72 (1%) 

20 

Query: 1 WARFDSILSDNGATVVESKDWEKRRIAYEIQDFTEGLYH^ 60 

++ RF+++L+ NGA + +KDW KRRLAYEI DF +G Y IVNV++ DA A+ EFDRL+K 
Sbjct: 22 VIERFNNVLTSNGAEITGTKDWGKRRLAYEINDFRDGFYQIVNVQS-DAAAVQEFDRIAK 80 

25 Query: 61 INGDILRHMIVK 72 

1+ DI+RH++VK 
Sbjct: 81 ISDDIIRHIWK 92 

A related DNA sequence was identified in S. pyogenes <SEQ ID 7215> which encodes the amino acid 
30 sequence <SEQ ID 7216>. Analysis of this protein sequence Teveals the following: 
Possible site: 40 

»> Seems to have no N-terminal signal sequence 

Final Results 

35 bacterial cytoplasm Certainty=0 .2720 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

An alignment of the GAS and GBS proteins is shown below. 

40 Identities = 66/74 (89%) , Positives = 70/74 (94%) 

Query: 1 WARFDSILSDNGATVVKSKDVffiKERI^YEIQDFTEGLYHIVNVEAEDAVALNEFDRLSK 60 

+VARFDSIL+DNGATWESKDWEKRRIAYEI DF EGLYHIVN+EA DA ALNEFDRLSK 
Sbjct: 22 LVARFDSILTDNGATVVESKDWEKRRIAYEINDFREGLYHIVNLEATDAAAIiNEFDRLSK 81 

45 

Query: 61 INGDILRHMIVKVD 74 

INGDILRHMIVK+D 
Sbjct: 82 INGDILRHMIVKLD 95 

50 Based on this analysis, it was predicted that these proteins and their epitopes could be useful antigens for 
vaccines or diagnostics. 
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Example 2361 

A DNA sequence (GBSx2518) was identified in S.agalactiae <SEQ ID 7219> which encodes the amino 
acid sequence <SEQ ID 7220>. This protein is predicted to be surface protein Rib. Analysis of this protein 
sequence reveals the following: 

5 Possible site: 49 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 5289 (Affirmative) < suco 

10 bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.pyogenes. 

Based on this analysis, it was predicted that this protein and its epitopes, could be useful antigens for 
15 vaccines or diagnostics. 

Example 2362 

A DNA sequence (GASxlR) was identified in S.pyogenes <SEQ ID 7221> which encodes the amino acid 
sequence <SEQ ID 7222>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

20 

»> Seems to have an uncleavable N-term signal seg 

Final Results 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

25 bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2363 

A DNA sequence (GASx5R) was identified in S.pyogenes <SEQ ID 7223> which encodes the amino acid 
sequence <SEQ ID 7224>. Analysis of this protein sequence reveals the following: 

35 Possible site: 20 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

40 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
45 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2364 

A DNA sequence (GASxll) was identified in S.pyogenes <SEQ ID 7225> which encodes the amino acid 
sequence <SEQ ID 7226>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

5 

>>> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2614 (Affirmative) < suco 

10 bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2365 

A DNA sequence (GASxl7) was identified in S.pyogenes <SEQ ID 7227> which encodes the amino acid 
sequence <SEQ ID 7228>. Analysis of this protein sequence reveals the following: 

20 Possible site: 30 

»> Seems to have no N- terminal signal sequence 

Final Results 

25 bacterial cytoplasm — - Certainty=0 .2849 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) c suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2366 

A DNA sequence (GASxlS) was identified in S.pyogenes <SEQ ID 7229> which encodes the amino acid 
35 sequence <SEQ ID 7230>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

»> Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm — Certainty=0. 2099 (Affirmative) < suco 
bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside — - Certainty=0. 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2367 

A DNA sequence (GASx34) was identified in S.pyogenes <SEQ ID 723 1> which encodes the amino acid 
sequence <SEQ ID 7232>. Analysis of this protein sequence reveals the following: 

Possible site: 54 

i» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0801 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2368 

A DNA sequence (GASx38) was identified in S.pyogenes <SEQ ID 7233> which encodes the amino acid 
sequence <SEQ ID 7234>. Analysis of this protein sequence reveals the following: 
Possible site: 18 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CRB12617 GB:Z99108 similar to protein-tyrosine phosphatase 
[Bacillus subtilis] 
Identities = 57/155 (36%) , Positives = 88/155 (56%) , Gaps = 12/155 (7%) 



Sbjct 
Query: 
Sbjct 

Sbj. 



MKKVCFVCLGNI CRSPMAEFVMKS I VS SDVMMIESRATSDWEHGNPIHSGTQSILK 56 

M V FVCLGNI CRSPMAE + + + + + +S W GNP H GTQ IL+ 

MISVLFVCLGNICRSPMAEAIFRDLAAKKGLEGKIKADSAGIGGWHIGNPPHEGTQEILR 60 



57 TYQINYDITKCSKQITITDFNTFDY I IG^E)SDNVICNLKEMSQHQWDSKIYLFRE- - 

I++D ++Q++ D + FDYII MD++N+ +L+ M+ + S I + 
61 REGISFD-G 

111 -GGVPDPWmiDFEETYQLVRKGCQDWLSRLMSKE 144 

VPDP+YT +FEE QL++ GC+ Irf- + ++ 
120 LADVPDPYYTGNFEEVCQLIKTGCEQLLASIQKEK 154 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2369 

A DNA sequence (GASx42R) was identified in S. pyogenes <SEQ ID 7235> which encodes the amino acid 
sequence <SEQ ID 7236>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4753 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2370 

A DNA sequence (GASx47R) was identified in S.pyogenes <SEQ ID 7237> which encodes the amino acid 
sequence <SEQ ID 7238>. Analysis of this protein sequence reveals the following: 
Possible site: 58 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm --- Certainty=0 .2014 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2371 

A DNA sequence (GASx53R) was identified in S.pyogenes <SEQ ID 7239> which encodes the amino acid 
sequence <SEQ ID 7240>. Analysis of this protein sequence reveals the following: 

Possible site: 45 



Final Results 

bacterial membrane Certainty=0 . 1044 (Affirmative) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2372 

A DNA sequence (GASx67R) was identified in S.pyo genes <SEQ ID 724 1> which encodes the amino acid 
sequence <SEQ ID 7242>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1610 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS -specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2373 

A DNA sequence (GASx75) was identified in S.pyogenes <SEQ ID 7243> which encodes the amino acid 
sequence <SEQ ID 7244>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2803 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MKVRPS VKP I CE YCKVIRRNGRVMVI CPTNPKHKQRQG 38 

MKVRPSVKPICEYCKVIRRNGRVMVICP NPKHKQRQG 
Sbjct: 1 MKVRPSVKPI CEYCKVIRRNGRVMVI CPANPKHKQRQG 38 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2374 

A DNA sequence (GASx76) was identified in S.pyogenes <SEQ ID 7245> which encodes the amino acid 
sequence <SEQ ID 7246>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

>>> Seems to have no N-terminal signal sequence 
Final Results 
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bacterial cytoplasm Certainty=0 . 0824 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0DO0 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB06824 GB-.L47971 ribosomal protein S13 [Bacillus subtilis] 
Identities = 86/121 (71%) , Positives = 103/121 (85%) 

Query: 1 MARIAGVDIPNDKRWISLTYVYG1GLATSKK1IAAAGISEDIRVKDLTSDQEDAIRREV 60 

MARIAGVDIP DKRWISLTY++GIG T++++L AG+SED RV+DLT ++ IR + 
Sbjct: 1 MARIAGVDIPRDKRWISLTYIFGIGRTTAQQVLKEAGVSEDTRVRDLTEEELGKIRDI1 60 

Query: 61 DAIfCVEGDLRREVNMNIKRLMEIGSyRGIRHRRGLPVRGQNTKNNARTRKGKAVAIAGKKK 121 

D +KVEGDLRREV++NIKRL+EIGSYRGIRHRRGLPTOGQN+KNNARTRKG +A KKK 
Sbjct: 61 DKLKVEGDLRREVSLNIKRLIEIGSYRGIRHRRGLPVRGQNSKNNARTRKGPRRTVANKKK 121 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2375 

A DNA sequence (GASx81R) was identified in S.pyogenes <SEQ ID 7247> which encodes the amino acid 
sequence <SEQ ID 7248>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=o. 1842 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2376 

A DNA sequence (GASx82) was identified in S.pyogenes <SEQ ID 7249> which encodes the amino acid 
sequence <SEQ ID 7250>. Analysis of this protein sequence reveals the following: 

Possible site: 59 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 3613 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2377 

A DNA sequence (GASx83) was identified in S.pyogenes <SEQ ID 725 1> which encodes the amino acid 
5 sequence <SEQ ID 7252>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

>» Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 .1141 (Affirmative) < suco 

bacterial membrane Certainty= 0 . 0 0 0 0 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2378 

20 A DNA sequence (GASx85) was identified in S.pyogenes <SEQ ID 7253> which encodes the amino acid 
sequence <SEQ ID 7254>. Analysis of this protein sequence reveals the following: 
Possible site: 16 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0 . 2280 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2379 

A DNA sequence (GASx89R) was identified in S.pyogenes <SEQ ID 7255> which encodes the amino acid 
sequence <SEQ ID 725 6>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

40 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3040 (Affirmative) < suco 

bacterial membrane' Certainty=0. 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2380 

5 A DNA sequence (GASxl02) was identified in S.pyogenes <SEQ ID 7257> which encodes the amino acid 
sequence <SEQ ID 7258>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

>>> Seems to have an uncleavable N-term signal seq 
10 INTEGRAL Likelihood =-13.75 Transmembrane 21 - 37 ( 12 - 41) 

Final Results 

bacterial membrane --- Certainty=0 .649B (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

15 bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC45312 GB:U81957 ComYC [Streptococcus gordonii] 
20 Identities = 59/104 (56%) , Positives = 85/104 (81%) , Gaps = 1/104 (0%) 

Query: 6 NNLRHKKLKGFTLLEMLLVILVISVLMLLFVPNLSKQKDRVTETGNAAWKLVENC^SELY 65 

N L+ ++K FTL+EML+V+L+ISVLMLLFVPNL+KQK+ V++TGNAAWK+VE+QAELY 
Sbjct: 2 NKLKKLRVKAFTLVEMLVVLLIISVLMLLFVPNLTKQK3AVSDTGNAAWro7VESQAELY 61 

25 

Query: 66 EL-SQGSKPSLSQLKM3GSITEKQEKAYQDYYDKHKNEKRRIiSN 108 

EL + G + +LS+L A G+1++KQ 4-Y+ YY K+ +E ++N 
Sbjct: 62 ELKNTGDQATLSKLVAAGNIEQKQADSYKAYYGKNNSETQAVAN 105 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2381 

A DNA sequence (GASxl03) was identified in S.pyogenes <SEQ ID 7259> which encodes the amino acid 
sequence <SEQ ID 7260>. Analysis of this protein sequence reveals the following: 

35 Possible site: 24 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

40 bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
45 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC23740 GB:AF052207 competence protein [Streptococcus pneumoniae] 
Identities - 52/131 (39%) , Positives = 76/131 (57%) 



50 



Query: 8 
Sbjct: 2 



IKAFTLLETLLSLSVMSFIILGLSVPVTKSYQIO/EEHLFFSHFEHLYRHQQKLAILQQKQ 67 
IKAFT+LE+LL L ++S + LGLS V ++ VEE +FF FE LYR QK ++ Q++ 
IKAFTMLESLLVLGLVSILALGLSGSVQSTFSAVEEQIFFMEFEELYRETQKRSVASQQK 61 
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Query: 68 • RVLDISSTKIVTEGNSLWPKSITVNHPYRLVIDQMGGNH31AKIIFDMTDRRFKYQFYL 127 

L++ I LTVPK I + D+ GGN SLAK+ F + +YQ YL 

Sbjct: 62 TSLNLDGQMISNGSQKLTVPKGIQAPSGQSITFDRAGGNSSIiAKVEFQTSKGAIRYQLYL 121 

5 Query: 128 GSGNYQKTSQS 138 

Sbjct: 122 GNGKIKRIKET 132 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
10 antigens for vaccines or diagnostics. 

Example 2382 

A DNA sequence (GASxl04) was identified in S.pyogenes <SEQ ID 7261> which encodes the amino acid 
sequence <SEQ ID 7262>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

15 

>>> Seems to have a cleavable N-term signal seg. 

Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

20 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

25 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2383 

A DNA sequence (GASxl09) was identified in S.pyogenes <SEQ ID 7265> which encodes the amino acid 
sequence <SEQ ID 7266>. Analysis of this protein sequence reveals the following: 

30 Possible site: 45 

>>> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood =-10. SI Transmembrane 37 - 53 ( 28 - 58) 
INTEGRAL Likelihood = -3.56 Transmembrane 61 - 77 ( 60 - 77) 

35 

Final Results 

bacterial membrane Certainty=0 . 5203 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

45 Example 2384 

A DNA sequence (GASxl 15R) was identified in S.pyogenes <SEQ ID 7267> which encodes the amino acid 
sequence <SEQ ID 7268>. Analysis of this protein sequence reveals the following: 

Possible site: 18 
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Final Results 

bacterial membrane Certainty=0. 5437 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2385 

A DNA sequence (GASxl24) was identified in S.pyogenes <SEQ ID 7269> which encodes the amino acid 
sequence <SEQ ID 7270>. Analysis of this protein sequence reveals the following: 

Possible site: 52 

>» Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -8.17 Transmembrane 31 - 47 ( 29 - 59) 
INTEGRAL Likelihood = -5.63 Transmembrane 737 - 753 ( 734 - 756) 



Pinal Results 

bacterial membrane — Certainty=0. 4270 (Affirmative) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC97148 GB:U493 97 Cpa [Streptococcus pyogenes] 
Identities = 401/737 (54%), Positives = 517/737 (69%), Gaps = 25/737 (3%) 







25 


35 


Sbjct: 


11 






83 


40 


Sbjct: 


69 




Sbjct: 


129 


45 




203 




Sb j ct : 


187 


50 




263 




Sbjct: 


243 






323 


55 


Sbjct: 


298 






382 



S N+KR T+ L+ VFL AL+ + + FG E S PN S +Y WYGY+S 



E+ SN ++ QL LMR+ALK+LIDPNL +K + P ++L++FES D 



I +EL +GTYTLTE +SP GY 1AEPI F+VE KV+ ^ 



VL+ Y KFYYA NK+ SSQWYCFNADL SPP 
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Sbjct: 358 QVENPNKEVMEYSVI^YOTF^EEVLSGFTPYGKFYYATlIKDKSSQVVYCFNADIiHSPP 417 

Query: 441 DSEDGGKTMTPDFTT-GEVECYTHIAGRDLFKYTVKPRDTDPDTFLKHIKKVIEKGYREKG 499 
DS D G+T+ PD +T EVKYTH AG DLFKY ++PRDT+P+ FLKHIKKVIEKGY++KG 
5 Sbjct: 418 DSYDSGETIMPDTSTMKEVKYTHTAGSDLFKYALRPRDTISIPEDFliKHlKKVIEKGYKKKG 477 

Query: 500 QAIEYSGLTETQLRAATQIAIYYFTDSAELDKDKL KDYHGFGDMNDSTLAVAKILV 555 

4 Y+GLTETQ RAATQLAIYYFTDSA+L K K YHGF M++ TLAV K L+ 

Sbjct: 478 DS--YNGLTETQFRAATQLAIYYFTDSADjKTLKTYMNGKGYHGFESMDEKTLAVTKELI 535 

10 

Query: 556 EYAQDSNPPQLTDLDFFIPNNNKYQSLIGTQWHPEDLVDIIRMEDKK-EVIPVTHNLTLR 614 

YAQ+ + PQLT4LDFF+PNN+K QSLIGT4 HP+DLVD+IRMEDKK EVIPVTH+LT++ 
Sbjct: 536 TYAQNGSAPQLTNLDFFVPNNSKDQSLIGTECHPDDLVDVIRMEDKKQEVIPVTHSLTVK 595 

15 Query: 615 KWTGI^GDRTKDFHFEIELKNNKQELLSQTVKTDKTNLEFKIJGKATIMLKHGESLTLC^ 674 

KTV G GD+TK F FE+ELK+ + + T+KT+ +h KDGK 4 NLKHG444 ++G 
Sbjct: 596 KTWGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNLICHGDTIRIEG 655 

Query: 675 LPEGYSYLVKETDSEGYKVKVNSQEVANATVSKTGITSDETLAFENNKEPWPTGVDQKI 734 
20 LP GYSY 4KB +4+ Y V V444 A IT D4 4 FEN K4 V PTG4 

Sbjct: 656 LPTGYSYTLKEAEAKDYIVTVDNKVSQEAQSVGKDITEDKKVTFENRKDLVPPTGLTTDG 715 

Query: 735 NGYLALIVIAGISLGIW 751 
YL L444 4 L 4W 
25 Sbjct: 716 AIYLWLLLLVPLGLLVW 732 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2386 

30 A DNA sequence (GASxl25R) was identified in S.pyogenes <SEQ ID 7271> which encodes the amino acid 
sequence <SEQ ID 7272>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

>» Seems to have no N-terminal signal sequence 

35 

Final Results 

bacterial cytoplasm Certainty=0 . 2604 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2387 

A DNA sequence (GASxl26) was identified in S.pyogenes <SEQ ID 7273> which encodes the amino acid 
sequence <SEQ ID 7274>. Analysis of this protein sequence reveals the following: 

Possible site-. 14 

50 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Ceitainty=0 .1537 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

55 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.t 
The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC97149 GB:U49397 LepA [Streptococcus pyogenes] 
Identities = 59/132 (44%) , Positives = 84/132 (62%) , Gaps = 5/132 (3%) 

Query. 1 MIIKPJSTOMRPSVKAGDAILFYR^QTYroffiEAVVYEDSKTSITKVGRIIAQAGDEVDLTE SO 

Mil NDM+P++ AGD +L+YRL+ + + WYE T KVGRI AQAGDEV+ T+ 
Sbjct: 42 MIIimroMSPALSAGDGVI,YYRIADRSHI!raVWYEVDMr--LKVGRIAAQAGDEVlIFTQ 99 

Query: 61 QGELKINGHI QNEG LTFI KBREAOTPYRIADNS YLILNDYYSQESENYLQDAIAKDA 117 

+G L INGH + LT+ S N+PY++ +Y IUNDY + ++ A+ + 
Sbjct: 10D EGGLLIMGHPPEKEVPYLTYPHSSGPKFPYKVPTGTYFILNDYREERLDSRYYGALPIHQ 159 

Query: 118 IKGTINTLIRLR 129 

1KG I+TL+R+R 
Sbjct: ISO IKGKISTLLRVR 171 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2388 

A DNA sequence (GASxl27) was identified in S.pyogenes <SEQ ID 7275> which encodes the amino acid 
sequence <SEQ ID 7276>. Analysis of this protein sequence reveals the following: 
Possible site-. 17 



Final Results 

bacterial membrane Certainty=0. 2572 (Affirmative) < suco 

bacterial outside — Certainty=0 . 0000 {Wot Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MKLRHLLLTGAALTSFA ATTVHGET- - VVNGAKLTVTKNL-DLVNSNALIPNTDF 52 

MK LLL A L + + + ET V++G+ L V K + N L+P D+ 

Sbjct: 1 MKKNKLLIATAILATALGMASMSQNIKAETAGVIDGSTLVVKKTFPSYTDDNVLMPKADY SO 

Query: 53 TFKTEPDTTVN EDGNKFK-GVALKTPMTK-VTYTNSDKGGSNTKTAEFDFSEVTFEK 107 

+FK+E D +DG K GV TK + Y+NSDK + K+ F+F+ V F 

Sbjct: 61 SFKVEADDNAKGKTKDGLDIKPGVIDGLENTKTIRYSNSDKITAKEKSVNFEFANVKFPG 120 

Query: 108 PGVYYYKOTEEKIDKVPGVSYDTTS YTVQVHVLWNEEQQKPVATYIVGYKEGS - - KVPIQ 155 

GVY Y V S +K G++YD+ +TV V+V+ N+E YIV + G K P+ 

Sbjct: 121 VGWRYWAEVNGNKA-GITYDSQQWTVDVYW-NKEGGGFEViaiVSTEVGQSEKXPVL 178 

Query: 166 FKNSLDSTTLTVXKKVSGTGGDRSKDFNFGLTLKANQYYKASEKVMIEKTTKGGQAPVQT 225 

FKNS D+T+L ++K+V+G G+ + F+F L L N+ + EK + +GG+ 

Sbjct: 179 FKNSFDTTSLKIEKQVTGNTGEHQRLFSFTLLLTPMECF EKGQWNILQGGETK 232 

Query: 226 EASIDQLYHFTLKDGESIKVTNLPVGTOYV^/TEDDYKSEKYTTTTVEVSPQDGAVKNIAGN 285 

+ I + Y FTLKD S+ ++ LPVG++Y +TE+D + Y T+ + + + G 
Sbjct: 233 KWIGEEYSFTLKDKGSVTLSQLPVGIEYKLTEEDVTKDGYKTSATLKDGEQSSTYELGK 292 



Query: 286 STEQETSTDKDiyrriTFTNKmFEVPTGVAMOTAPYIAMIVAVGGALYFViaaCIJA 340 
+ + S D+ I TNK+D +VPTGV T+AP+ L IVA+GG +Y K+K A 
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Sbjct: 293 DHKTDKSADE IVVTNKRDTQVPTGWGTLAPFA\TjSIVAIGGVIYITKRKKA 344 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2389 

A DNA sequence (GASxl28) was identified in S.pyogenes <SEQ ID 7277> which encodes the amino acid 
sequence <SEQ ID 7278>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have a cleavable N-term signal eeq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC97152 GB:U49397 unknown [Streptococcus pyogenes] 
Identities = 115/240 (47%) , Positives = 178/240 (73%) , Gaps = 3/240 (1%) 

Query: 1 MITOLIKLLDKLI WIVLCFFFLCLLIAALGIYDALTWC^ANATNYQQyKKKGVQ- - FD 58 

M++ ++++++K 1+ ++L F + L +A G++D+ +YQ A+A+N++++K Q F+ 
Sbjct: 351 MMMTIVQVINKAIDTLILIFCLWLFLAGFGLWDSYHLYQQADASNFKKFKTAQQQPKFE 410 

Query: 59 DLIAINSDVMAWLTVKGTHIDYPlVQGENNLEYINKSVEGEYStjSGSVFLDYRNKVTFED 118 

DLLA+N DV+ WL + GTHIDYP+VQG+ NLEYINK+V+G ++SGS+FLD RN F D 
Sbjct: 411 DLIALOTDVIGWimPGTHIOTPLVC^KTOTi^^ 470 

Query: 119 KJSLiIYAHHMAGNVMFGEIiPNFRKKSFFNJCHICEFSIETKTKQKIiKINIFACIQTDAFDSL 178 

YSLIY HHMAGN MFGE+P F KK+FFNKH + IETK ++KL + IFAC++TDAFD L 
Sbjct: 471 DYSLIYGHHMAGNAMFGEIPKFLKKNFFNKHNKAIIETKERKKLTVTIFACLKTDAFDQL 530 

Query: 179 LFNPIDV-DISSKNEFLNHIKQKSVQYREILTTNESRFVALSTCEDMTTDGRIIVIGQIE 237 

+FNP + + 4 + ++S Q++ + + ++FVA STCE+ +TD R+IV+G 1+ 

Sbjct: 531 VFNPNAITNQDQQRQLVDYISKRSKQFKPVKLKHHTKFVAFSTCENFSTDNRVIWGTIQ 590 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2390 

A DNA sequence (GASxl29) was identified in S.pyogenes <SEQ ID 7279> which encodes the amino acid 
sequence <SEQ ID 7280>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -6.05 Transmembrane 5 - 21 ( 4-22) 
INTEGRAL Likelihood = -5.04 Transmembrane 191 - 207 ( 186 - 209) 

Final Results 

bacterial membrane Certainty=0. 3421 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=o . 0000 (Not Clear) < suco 

LPXTG motif: 181-186 
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No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC97151 GB:U49397 unknown [Streptococcus pyogenes] 
Identities = 64/213 (30%) , Positives = 106/213 (49%) , Gaps = 20/213 (9%) 

5 







1 


MKKSILRILaiGYLLMSFCIiLDSVEAENLTASINIEVINQraVATNKQSSDIDETFMFVI 


60 








M+K + ++ +L +V A++ T +IVN++A+ F + 






Sbj ct : 


1 


MRKYWKMLFSVWMLTMIAraQTVLAKDSTVQTSISVENVLERAGDSTP FSIAL 


54 


10 




61 


EALDKESPLPNSVTTSVKGNGICrSFEQLTFSEVGQYHYKIHQLLGKNSQYHYDETVYEvV 


120 








E++D + ++ G+GK SF L F+ VGQY Y+++Q +N Y D TV++V+ 






Sbjct: 


55 


ESIDAMKTIEE- - - ITIAGSGKASFSPLNFTTVGQYTYRVYQKPSQNKDYQADTTVFDVL 


111 








IYVLYNEQSGALETNLVSNKLGETEKSELIFKQEYSEKTPEPHQPDTTEKEKPQKKRNGI 


160 


15 






+YV Y+E G L ++S + G+ EKS + FK + K P QPD + 






Sbjct: 


112 


VYVTYDE-DGTLVAKVISRRAGDEEKSAITFKPKRLVKPIPPRQPDIPKTP 


161 






181 


LPSTGEMVSYVSALGIVLVATITLYSIYKKLKT 213 










LP GE+ S + L IVL+ + L + KKLK+ 




20 


Sbjct: 


162 


IiPLAGEVKSLLGILS I VLLGLLVLLYV- KKLKS 193 





Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2391 

25 A DNA sequence (GASxBOR) was identified in S.pyogenes <SEQ ID 7281> which encodes the amino acid 
sequence <SEQ ID 7282>. Analysis of this protein sequence reveals the following: 
Possible site: 57 

>» Seems to have no N-terminal signal sequence 

30 

Final Results 

bacterial cytoplasm Certainty=0 . 1614 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB54046 GB:AJ245436 hypothetical protein, 57.8 kD [Pseudomonas 
putida] 

40 Identities = 128/388 (32%) , Positives = 204/388 (51%) , Gaps = 21/388 (5%) 







Sbjct: 


126 




64 


Sbjct: 


124 


Sbjct: 


239 


Query: 


180 


Sbjct: 


298 


Query: 


236 



IGSWQRQELVFIPAQLKRINHVQHAYKCQTCSDNSLSDKIIXAPVPKAPLAHSLGSASI 63 
IG V Q L +P Q++ I HV+ Y C+ C ++ A P + S+ S S+ 

IGEEVSEQ-LEIVPMQIRVIKHVRKVYGCRDCESAPVT ADKPAQMIEKSMASPSV 179 

IAHTVHQKFTLKVPNYRQEEDWNKLGLS ISRKE IANWHI KSSQYYFEPLYDLLRDILLSQ 123 
+A + K+ +P +R E+ + G+ I R+ +A W 1+ S++ F+PL +L+R+ LL+ 
IiAMLLTTKYVDGLPLHRFEKVLGRHGIDIPROTLA^ 238 

EVIHADETSYRVLESD TQIiTYYWTFLSGKHEKKGITIjYHHDKRRSGLVTQEVLGDY 179 

+IH DET +VL+ + ++ W G ++ + L+ + R+ V +L Y 



h +GCWAH RRKF EA 
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Sbjct: 358 GVERDLKDSDDEDRKVARMERSLPLLTQLKNVA7EKTQEQVTTQNALGKA.IGYIaRS1OTSKL 417 

Query: 294 RTVLEDGHIVIiSNNMAERAIKSLVMGRKIWnjFSQSFEGAKAAAI^ 353 
+E G++ + NN AERAI+ V+GRKNWLFS + +GA A+A + SL+ETAK +G 
5 Sbjct: 418 ERYVEHGYLPMDNWAAERAIRPFVIGRKNWLFSDTPKGATASAQLYSLVETAKANGQEPY 477 

Query: 354 KYI S YLLDRLPNEETLAKRE VLEAYLPW 381 

++ + L+RLP ++ E EA LPW 
Sbjct: 478 AWLRHALERLPQACSV EDYEALLPW 502 

10 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2392 

A DNA sequence (GASxl31R) was identified in S.pyogenes <SEQ ID 7283> which encodes the amino acid 
15 sequence <SEQ ID 7284>. Analysis of this protein sequence reveals the following: 

Possible site: 37 

»> Seems to have no N-terminal signal sequence 

20 Final Results 

bacterial cytoplasm Certainty=0 . 4465 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

25 No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2393 

30 A DNA sequence (GASxl32R) was identified in S.pyogenes <SEQ ID 7285> which encodes the amino acid 
sequence <SEQ ID 7286>. Analysis of this protein sequence reveals the following: 

Possible site: 46 

»> Seems to have no N-terminal signal sequence 

35 

Final Results 

bacterial cytoplasm Certainty=0. 1529 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA84885 GB:AB024946 orf50 [Escherichia coli] 
Identities = 37/91 (40%), Positives = 53/91 (57%) 

45 

Query: 10 QVYLVCGKTD^QGIDSIAYLVKSQKELDLFSGAVYLFCGGRPJJRFKALYWDGQGFWLLY 69 

+++LV G TDMR G + LA V++ + D FSG +++F G R D+ K L+ D G L 
Sbjct: 9 RIWLVAGITDMRNGFNGl^SKVQNVLKDDPFSGHLFIFRGRRGDQIKVLWADSDGLCIiFT 68 

50 Query: 70 KRFENGKLAWPRNRDEVKCLTAVQVDWLMKG 100 

KR E G+ WP RD LT Q+ L++G 
Sbjct: 69 KRLERGRFVWPVTRDGKVHLTPAQLSMLLEG 99 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2394 

A DNA sequence (GASxl33R) was identified in S.pyogenes <SEQ ID 7287> which encodes the amino acid 
5 sequence <SEQ ID 7288>. Analysis of this protein sequence reveals the following: 

Possible site: IB 

»> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 . 1979 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

1 5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2395 

20 A DNA sequence (GASxl35R) was identified in S.pyogenes <SEQ ID 7289> which encodes the amino acid 
sequence <SEQ ID 7290>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

»> Seems to have a cleavable N-term signal seq. 

25 

Final Results 

bacterial outside Certainty=0. 3 000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

35 Example 2396 

A DNA sequence (GASxl36) was identified in S.pyogenes <SEQ ID 7291> which encodes the amino acid 
sequence <SEQ ID 7292>. Analysis of this protein sequence reveals the following: 

Possible site: 54 

40 >» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-11.73 Transmembrane 222 - 238 ( 212 - 242) 

INTEGRAL Likelihood =-10.88 Transmembrane 37 - 53 ( 32 - 57) 

•INTEGRAL Likelihood = -9.87 Transmembrane 462 - 478 ( 456 - 478) 

INTEGRAL Likelihood = -4.25 Transmembrane 119 - 135 ( 117 - 137) 

45 INTEGRAL Likelihood - -2.60 Transmembrane 308 - 324 ( 306 - 324) 

INTEGRAL Likelihood = -1.28 Transmembrane 164 - 180 ( 164 - 180) 

INTEGRAL Likelihood = -0.06 Transmembrane 137 - 153 ( 137 - 153) 

INTEGRAL Likelihood = -0.06 Transmembrane 343 - 359 ( 343 - 359) 
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Final Results 

bacterial membrane Certainty=0. 5692 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=O.O000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04077 GB:AP001508 short-chain fatty acids transporter 
(Bacillus halodurans] 
Identities = 158/465 (33%), Positives = 248/465 (52%), Gaps = 41/465 (8%) 

5TEKTGGI I YGWVN 74 
T T 1+ W 

I SLSNRLMQRYL PDPFLFWLLTFLVFALSLIFTES TPLT- - IVQYWGE 51 

GFWGLLSFAMQMTILLATGNAVASSPPAHKMFKSLAKLPQTRTQIFIFSIWGSIFGFLH 134 
GFWGLLSF+MQM ++L TG+ +ASSP K +LA LP + Q + W + F++ 



WG G+++ +F KEL +K V L +A+ + F+ GLSG+ L ATPD+ 





15 


Sbjct: 


5 


Query: 


75 


Sbjct: 


52 


Query: 


135 


Sb j Ct : 


112 




195 


Sbjct: 


168 


Query: 


250 


Sbj ct : 


218 






Sbjct: 


273 




3S9 


Sbjct: 


331 


Query: 


429 


Sbjct: 


391 



! 31 , T Li ' IGQC ! ?"a Vi '-X - J VMSSWHI TjQFPF G I FGI IQSTGLGLEI 36B 
■ N LFL LG+L G P+ + V + G+++QFPFYAG+ GI+ S+GL + 

51 VNSLFLFLGILFHGT- - PIO.FLKAVTSAVKGASGIIIQFPFYAGLMGIMVSSGLATVM 330 

!HFFVAISNGTTWPVFAYLYSALLNIAVPSGGSKFi7IEAPYIVPATIEVGNDLGKILQAY 428 
3 FV+ SN T+P+F +L + ++N+ VPSGG ++ ++AP ++ A +G K A 
3EAFVSFSNEVTFPLFVFLSAGIV1WFVPSGGGQWAVQAPWLEAAQSLGVPAAKAAMAV 390 

JLGDATTNLIVPFWALSYLSNFKLKFNQIVAYTIPCVLWTGIAI 473 

GDA TN+I PFWAL L+ LK 1+ + + +LW+G+ I 
WGDAWTNMIQPFWALPAIiAIAGLKAKDIMGFCV-MILVVSGVVI 434 ' 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2397 

A DNA sequence (GASxl37R) was identified in S.pyogenes <SEQ ID 7293> which encodes the amino acid 
sequence <SEQ ID 7294>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2591 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 
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>GP:AAC22434 GB: 032761 transcriptional regulator [Haemophilus influenzae Rd] 
Identities = 37/107 (34%) , Positives = 56/1Q7 (51%) , Gaps = 1/107 (0%) 

Query: 21 LHRQNLVTFDKTFMINHQLTTLFEEANSLPWKCySASWDFLLNCTRYS-SYLTIIjPRPI 79 

LH+Q + FD+TFMI+H L FE H P + S+ WDFLL+ + + LTILP P+ 
Sbjct: 205 LHQQK^IFDQTFMIHHHLKEAFERNNCYPDIVLDiSSCWDFLLSAVKTNKELLTILPLPM 264 

Query: 80 THFAHmGLVEVQLTEHPKlEVVI^LKHNKTSHLKHYIKHTILDYF 126 

H + ++ W+V L + +HL+ YI +L+ F 

Sbjct: 265 AELYHSKEFLCRKIESPVPWKVTLCRQRKTVYTHLEEYIFDKLLEAF 311 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2398 

A DNA sequence (GASxl40) was identified in S.pyogenes <SEQ ID 7295> which encodes the amino acid 
sequence <SEQ ID 7296>. Analysis of this protein sequence reveals the following: 
Possible site: 50 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3351 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 00,00 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 22 ENKRIAIAEAISHIKDGDTIIWGGFMANGTPEALIDALVDKGTKDLTL1CNDAGFVDRGV 81 

4- K + + +A +DG TIMVGGFM GTP L++AL++ G +DLTLI ND FVD G+ 
Sbjct: 2 KTKLMTLQDATGFFRDGMTIMVGGFMGIGTPSRLVEALLESGVRDLTLIANDTAFVDTGI 61 

Query: 82 GKMVANHQFKTIYATHIGUSTKEAGRQMTAGETTIELIPQGTFAEKIRIGAYGIGGFYTPT 141 

G ++ N + + + A+HIG N E GR+M +GE + L+PQGT E+IR G G+GGF TPT 
Sbjct: 62 GPLIVNGRVRKVIASHIGTNPETGRRKIISGEMDV\rLVPQGTIjIEQIRCGGAGLGGFLTPT 121 

Query: 142 GVGTLVAEGKETKT I KGKTYLLEYPFEADVALI FANQADEMGNLQYSGSENNFNQLMAAC '201 

GVGT+V EGK+T T+ GKT+LLE P AD+ALI A++ D +GNL Y S NFN L+A 
Sbjct: 122 GVGTvvEKGKQTLTLDGKTWLLERPbRADLALIRAHRCDTLGNLTYQLSARNFNPLIALA 181 

Query: 202 AKTTIVQAREIVPVGTIQPECVHTPHIFVDYIV 234 

A T+V+ E+V G +QP+ + TP +D+I+ 
Sbjct: 182 ADITLVEPDELVETGELQPDHIVTPGAVIDHII 214 

subunit (EC 2.8.3.-). [Escherichia coli] 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2399 

A DNA sequence (GASxHl) was identified in S.pyogenes <SEQ ID 7297> which encodes the amino acid 
sequence <SEQ ID 7298>. Analysis of this protein sequence reveals the following: 

possible site: 41 

?» Seems to have no N-terminal signal sequence 
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Final Results 

bacterial cytoplasm Certainty=0 .4941 (Affirmative) < suco 

bacterial membrane — -- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF12248 GB-.AE001862 CoA transferase, subunit B [Deinococcus radiodurans] 
Identities = 114/203 (56%) , Positives = 158/203 (77%) , Gaps = 3/203 (1%) 



GMGGAMDL+ G ++V+V MEH KG HK+L-t-ECTLPLT +GWD IIT++GV VTP G 



Query: 


11 


Sbjct: 


5 




69 


Sbjct: 






129 


Sbjct: 


125 




188 


Sbjct: 


185 



++++E++ G T DE++ TG 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2400 

A DNA sequence (GASxl44) was identified in S.pyogenes <SEQ ID 7299> which encodes the amino acid 
sequence <SEQ ID 7300>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

»> Seems to have no W-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 .3227 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA29948 GB:AP000003 137aa long hypothetical protein [Pyrococcus 
horikoshii] 

Identities = 49/113 (43%) , Positives = 71/113 (62%) , Gaps = 1/113 (0%) 

Query: 5 PEPMGPySTyTIEGHFLYTAGQLPLNPVTGQLSDG-FEAQCRQVFVNLQSILAEQKLDLN 63 

P+P+GPYS G+FL+ AGQ+P++P TG++ G + Q RQV N+++IL LN 

Sbjct: 22 PKPIGPySQAIKAGNFLFIAGQIPIDPKTGErVKGDIKDQTRQVLENIKAILEAAGYSLN 81 

Query: 64 HIYKIiNvYLTDvTTSrVEILNHVMTDLFEEPYPvRTAVQVSALPLQALIEVEAVA 116 

+ K+ VYL D4-+ +NV+FEPR AV+VS LP LIE+EA+A 
Sbjct: 82 DVIKVTVYLKDMNDFAKMNEVYAEYJGESKPARVA^VSRLPKDVLIEIEAIA 134 



55 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2401 

A DNA sequence (GASxl46) was identified in S.pyogenes <SEQ ID 730 1> which encodes the amino acid 
sequence <SEQ ID 7302>. Analysis of this protein sequence reveals the following: 

Possible site: 16 



10 



i N-terminal signal sequence 



- Final Results 

bacterial cytoplasm Certainty=0 . 1238 (Affirmative) • 

bacterial membrane Certainty=0 . 0000 (Not Clear) < i 

bacterial outside Certainty=0 . 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2402 

A DNA sequence (GASxl47) was identified in S.pyogenes <lSEQ ID 7303> which encodes the amino acid 
sequence <SEQ ID 7304>. Analysis of this protein sequence reveals the following: 

20 Possible site: 30 





have no N-terminal signal sequence 










INTEGRAL 


Likelihood = 


11.46 


Transmembrane 


456 


472 


( 452 


- 481) 


INTEGRAL 


Likelihood = 


-8.17 


Transmembrane 


603 


619 


( 595 


- 623) 


INTEGRAL 


Likelihood = 


-6.85 


Transmembrane 


495 


511 


( 491 


- 518) 


INTEGRAL 


Likelihood = 


-5.31 


Transmembrane 


-120 


436 


( 418 


- 443) 


INTEGRAL 


Likelihood = 


-4.99 


Transmembrane 


396 


- 412 


( 392 


- 413) 


INTERNAL 


Likelihood = 


-1.59 


Transmembrane 


522 


538 


( 522 


- 538) 


INTEGRAL 


Likelihood = 


-0.64 


Transmembrane 


577 


593 


( 577 


- 593) 


INTEGRAL 


Likelihood = 


-0.43 




377 


393 


( 377 


- 393) 



- Final Results 

bacterial membrane - 
bacterial outside - 



- Certainty=0 .5585 (Affirmative 

- Certainty=0. 0000 (Not Clear) • 



bacterial cytoplasm --- certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MAISQMKKLAMVFEKDYLDLVLKTLQQSQLVEVRDMKQ^KH- - - WQDAFNKGNVKLPQIV 57 

MA+++M+K+ ++ +K ++4-L+ +Q VE4RD+ Q W + F P+++ 
Sbjct: 1 MAVTKMEKVTLISDKKNREILLQAVQGLHAVEIRDLFQESENNQWVETF FPEPEMI 56 

Query: 58 QYDLTHQKPLLDDEALQYLLQSQQELENGLASLSAFLPPIGKLTALRQ--KTPSLSFKQF 115 

D K L Y L + + F+ G+ + +Q K LS 

Sbjct: 57 DKDKELAK LSYKLTD IRTAIQFIEHHGEKSQKKQHLKRRELSLDTL 102 

Query: 116 EERHRQQAAQTALKMMSQKIERLEQLQSKIDQLTEYCQSLEKWRSLTVLPQDLAQFHFLS 175 

E-t- + ++A L+ + E+ EQL + QL + L W++L + P+ 
Sbjct: 103 EKNYSEEAFSKKLEEVLLLKEQWEQLVDERQQLEDQENWLLNWQNLDLAPKAFDS-QMTK 161 

Query: 176 ARVGTIPSTANNHFYHQLKQHKGLFIEEVYH TEFEYGLVLFWQAQDTIHLQKYQFK 231 

+GT+ + F ++ + ++EE+ T F Y ++ +++ +Y F 

Sbjct: 162 LVIGTVNAKNAESFKAEVAEINEAYLEEINSSPTTTYFAYIVLRADESRMEEIASRYGFV 221 
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Query: 232 PLLYKEQLLPSEQLRINKELLTNWLAEIOSLLKELRQSQKIIl^QLQ'VEIDYVLSQYQRQQ 291 

Y + P +QL K+ L ++ L + + + + L++ +R+ 

Sbjct: 222 KEDYLYEGTPQQQLVAAKQSLQEIKDQQKKLSSAIGACSGYIKDFEWTEEIFLARSEREA 231 

Query: 292 TKKQLLGTRHLIALEGWIEADSVNQLKGIiMTKTLGDMFYLDSYDVTPDDW- -EDVPIKLR 349 

K +++ T +LI ++GW++ + +L ++ L ++D D+ E+VP KL+ 

Sbjct: 282 IKDRIIHTPYLILIQGWVDHEEKQELIHMI^IL^EEVYLTFDEPTDMEIAEEVPTKLK 341 

Query: 350 NHRYIAPFELVTEMYALPKYQEKDPTPFLAPLYLTFFGlyMVADIX3YGLLLYAVTIjAALVF 409 

WH +APFE+4TEMY+LPKY+E DPTP++' P YL FFGMMVAD+GYGLL++ 
Sbjct: 342 NHPIVAPFEMLTE^SLPKYEEVDPTPWMMPFYLVFFGMMVADIGYGLLMFLGAFLLQKL 4 01 

Query: 410 FNLQKTSKRLVTFFNILAISVAIWGLIYGSFFG-- --FDLPVALLSTKTDVITIL 460 

L + 4R FF ILAI IWG IY SFFG LP +LST DV TIL 

Sbjct: 402 WLPRGMQRFAKFFEILAIPSIIWGFIYSSFFGAALPKEIFGIHLPFPILSTTDDVNTIL 451 

Query: 461 WSLLFGFVTLIFGLLLGAWQQVRMKAYATAYTSSLAWTFILLGLLLFILGKNVSGLAYJj 520 

++S++FG + ++ GL + A + ++ KAY A AW +ILLG++L +LG 

Sbjct: 462 ILSVIFGLIQILVGLFIAAKEHIKRKAYVDAVNDGFAWQWILLGIILILLGTMTLKNNAF 521 

Query: 521 SVIGKWLALGNAFGILWSLLKSKSLL-GLGSGLYNLYGISSYLSDLVSFTRLMRLGLSG 579 

+G LA+ +A IL++ + +S S G+ G YNLYG++ Y+ DLVS+TRLMALG+SG 
Sbjct: 522 VYLGGALAVLSAVC1LIIPVFQSSSKAKGIAKGAYNLYGLTGYIGDLVSYTRLMALGISG 581 

Query: 580 ASIGAAFNMIVGIFPPVTRFTVGIFIFILLHAINIFLSMLSGYVHGARLIFVEFFGKFYE 639 

SI AAFNM+V PP RF+VGI + I+L A+N+FL++LS YVHGARL +VEFFGKFY 
Sbjct: 582 GS IAAAFNMLVAFMPPAARFSVGILLI IVLQALNMFLTLLSAYVHGARLQYVEFFGKFYT 641 

Query: 640 GGGKAFNPLIOjADNYVNVNEE 660 

GGG++F PLK + YVN+N + 
Sbjct: 642 GGGRSFKPLKTVEKYVNINHK 662 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2403 

A DNA sequence (GASxl48) was identified in S.pyogenes <SEQ ID 7305> which encodes the amino acid 
sequence <SEQ ID 7306>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

»> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -7.80 Transmembrane 28 - 44 ( 21 - 51) 
INTEGRAL Likelihood = -6.85 Transmembrane 148 - 164 ( 146 - 170) 
INTEGRAL Likelihood = -2.81 Transmembrane 105 - 121 ( 105 - 123) 

Final Results 

bacterial membrane Certainty=0. 4121 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA03841 GB:D16334 Na+-ATPase K subunit [Enterococcus hirae] 
Identities = 85/150 (56%), Positives = 107/150 (70%) 

Query: 20 HYFTAHGGVFFAALGIVLAVALSGMGSAYGVGKGGQAAAALLKEEPEKFTSALILQLLPG 79 

+ T +GG+ FA L + A SG+GSA GVG G+AAAAL +PEKF ALILQLLPG 
Sbjct: 4 YLITQNGGMVFAVLAMATATIFSGIG8AKGVGMTGEAAAALTTSQPEKFGQALILQLLPG 63 



60 Query: 80 SQGIYGFAIGILIWMKLTPELSVNQGLAYFLVSIiPIAIVGYFSAKHQGNVSVAGMQILAK 139 

+QG+YGF I LI++ L ++SV QGL + SLPIA G FS QG V+ AG+QILAK 
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Sbjct: 64 TQGLYGFVIAFLIFINLGSDMSWQG1^FLGASLPTAFT3LFSGIAQGKVAAAGIQILAK 123 

Query: 140 RPKDFMKGVILAAMVETYAIIAFWSFIIiL 169 

+P+ KG+I AAMVETYAIL FV+SF+L+ 
Sbjct: 124 KPEHATKGI I FAAMVETYAI LGFVI S FLLV 153 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2404 

A DNA sequence (GASxl49) was identified in S.pyogenes <SEQ ID 7307> which encodes the amino acid 
sequence <SEQ ID 7308>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 4510 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 





l 


Sbjct: 


3 




61 


Sbjct: 


63 


Query: 


120 


Sbjct: 


123 


Query: 


179 


Sbjct: 


183 


sed on 


this 



h K Q +Q +L KQ+ ++ LF + ++ N KEE+LA +KQ++ 



N+L+ +L++ + 



antigens for vaccines or diagnostics. 
Example 2405 

A DNA sequence (GASxl50) was identified in S.pyogenes <SEQ ID 7309> which encodes the amino acid 
sequence <SEQ ID 7310>. Analysis of this protein sequence reveals the following: 
Possible site: 30 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3095 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA04273 GB:D17462 Na+ -ATPase subunit C lEnterococcus hirae] 
Identities = 94/326 (28%) , Positives = 167/326 (50%) , Gaps = 5/326 (1%) 

EtNTTISVKEKELLTKEQFDKLLQAPOTTTLARLLHQSVYHLTVDDLNDLDRLESILMAE 65 
ELN I +E EL++K+ F++++Q + +h +L ++Y + D D D E+ L E 
ELNPLIRGRELELISKDTFEQMIQTDSIDSLGEILQSTIYQPYIYDGFDKD-FEANLSQE 63 

LTKTYRWAFAETPQPDIVQLFTLRYTYHWKVLLKAKASQADIiSHLLLPIGDKPLVALEH 125 

+K ++W P+P+IV ++T+RYT+HN+KVL KA+ + +L HL + G L L+ 

RSKLFQWLKESAPEPEIVWIYTMRYTFHNLKVLTKAEI^GQNLDHLYIHDGFYSLEVLKD 123 



Query: 
Sbjct: 


6 
5 


Query: 


66 


Sbjct: 


64 




126 


Sbjct: 


124 




186 


Sbjct: 




Query: 


246 


Sbjct: 


243 


Query: 


306 


Sbjct: 





30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2406 

A DNA sequence (GASxl51) was identified in S.pyogenes <SEQ ID 7311> which encodes the amino acid 
sequence <SEQ ID 7312>. Analysis of this protein sequence reveals the following: 

35 Possible site: 29 

>» Seems to have no N-terminal signal sequence 

Final Results 

40 bacterial cytoplasm Certainty=0 . 04B4 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
45 The protein has homology with the following sequences in the GENPEPT database: 

?GP:BAA04274 GB:D17462 Na+ -ATPase subunit G [Enterococcus hirae] 
Identities = 45/101 (44%) , Positives = 65/101 (63%) 

Query: 6 YKVGVIGNRDVILPFQMIGFQTFPVIKPQDAINQLRQLMEDFGIIYITEDIAAAIPEAL 65 
50 YK+GV+G++D + PF++ GF + + ++A ++G+IYITE A +PE + 

Sbjct: 3 YKIGWGDKDSVSPFRLFGFDVQHGTTKTEIRKTIDEMAKNEYGVIYITEQCANLVPETI 62 

Query: 66 THYDNQVLPAVIPLPTHQGAQGIGLSRIQAMVEKAVGQNIL 106 
Y Q+ PA+I +P+HQG GIGL IQ VEKAvGQNIL 
55 Sbjct: 63 ERYKGQLTPAIILIPSHQGTLGIGLEEIQNSVEKAVGQNIL 103 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2407 

A DNA sequence (GASxl52R) was identified in S.pyogenes <SEQ ID 7313> which encodes the amino acid 
sequence <SEQ ID 7314>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1043 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2408 

A DNA sequence (GASxl56) was identified in S.pyogenes <SEQ ID 731 5> which encodes the amino acid 
sequence <SEQ ID 73 1 6>: 

EYSI IPQLKETIHYIELKLEEAERASLVRIMKITS 

Analysis of this protein sequence reveals the following: 

Possible site: 16 

»> Seems to have no H-terrainal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 5026 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA04277 GB:D17462 Na+ -ATPase subunit D [Enterococcus hirae] 
Identities = 119/201 (59%) , Positives = 151/201 (74%) , Gaps = 2/201 (0%) 

Query: 10 RIJSnnCPTRMELSNLKNRLKTATRGHKLLKDKRDELMRRFVDIjIRENIffiljRQTIEKEIjAAN 69 

RLNV PTRNEL+ LK +L TATRGHKLLKDK+DELMR+F+ L1R+NNELRQ IEKE 
Sbjct: 2 RIiNVNPTPJIELTRLKKQLTTATRGHKIiLKDKQDELMRQFIIiljIRKNNEIjRQAIEKETQTA 61 

Query: 70 MKEFVIAKASENSLMVEELFAVPvHEVrLWIDIENIMSVOTPKFHVQSNTAREQEQGEFA 129 

MK+FVLAK++ ++EL A+P V++ + +NIMSV VP + Q + + E 

Sbjct: 62 MKDEVLAKSTVEFAFIDELLALPAEIWSISVVEKKIMSV^^ 119 

Query: 130 YSyLSSNSEMDlWIQKTKELLEKLLRlAEVEKTCQ^lADDIEKTRRRVNGljEYSII^QLK 189 

Y YL SN+E+D +1 +LL KLI1+LAEVEKTCQI1MA++ IEKTRRRVN LEY IPQL+ 
Sbjct: 120 YGYLHSNAELDRS IDGFTQLLPKLLKiAE V£KTCQL[vlAE3 1 EKTRRR VNAljEYMTT PQLE 179 
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Sbjct: 180 ETIYYIKMKLEENERAEVTRL 200 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2409 

A DNA sequence (GASxl61R) was identified in S.pyogenes <SEQ ID 7317> which encodes the amino acid 

sequence <SEQ ID 7318>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

10 »> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty-0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial cytoplasm Certair.ty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2410 

A DNA sequence (GASxl64) was identified in S.pyogenes <SEQ ID 7319> which encodes the amino acid 
sequence <SEQ ID 7320>. Analysis of this protein sequence reveals the following: 

Possible site: 36 

25 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -1.06 Transmembrane 9 - 25 ( 9 - 25) 

Pinal Results 

30 bacterial membrane Certainty=0 . 1426 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

A related sequence was also identified <SEQ ID 9091> which encodes the amino acid sequence <SEQ ID 
35 9092>. Analysis of this protein sequence reveals the following: ' 

Possible cleavage site: 33 
»> Seems to have a cleavable N-term signal seq. 

Final Results 

40 bacterial outside Certainty= 0 . 300 (Affirmative) < suco 

bacterial membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2411 

A DNA sequence (GASxl65) was identified in S.pyogenes <SEQ ID 7321> which encodes the amino acid 
sequence <SEQ ID 7322>. Analysis of this protein sequence reveals the following: 

Possible site: 59 

5 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2251 (Affirmative) < suco 

10 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2412 

A DNA sequence (GASxl 66) was identified in S.pyogenes <SEQ ID 7323> which encodes the amino acid 
sequence <SEQ ID 7324>. Analysis of this protein sequence reveals the following: 

20 Possible site: 34 

»■> Seems to have a cleavable N-term signal seq. 

Final Results 

25 bacterial outside — Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2413 

A DNA sequence (GASxl 67) was identified in S.pyogenes <SEQ ID 7325> which encodes the amino acid 
sequence <SEQ ID 7326>. Analysis of this protein sequence reveals the following: 

35 Possible site: 31 

>» Seems to have a cleavable N-term signal seq. 

Final Results 

40 bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

45 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2414 

A DNA sequence (GASxl68R) was identified in S.pyogenes <SEQ ID 7327> which encodes the amino acid 
sequence <SEQ ID 7328>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < .suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2415 

A DNA sequence (GASxl69R) was identified in S.pyogenes <SEQ ID 7329> which encodes the amino acid 
sequence <SEQ ID 7330>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

■»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside --- Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2416 

A DNA sequence (GASxl70) was identified in S.pyogenes <SEQ ID 733 1> which encodes the amino acid 
sequence <SEQ ID 7332>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood - -2.34 Transmembrane 154 - 170 ( 153 - 170) 

INTEGRAL Likelihood = -1.12 Transmembrane 20 - 36 ( 19 - 36) 

INTEGRAL Likelihood = -0.69 Transmembrane 52 - 68 ( 52 - 68) 

INTEGRAL Likelihood = -0.53 Transmembrane 399 - 415 ( 399 - 415) 



Final Results 

bacterial membrane --- Certainty=0. 193 5 (Affirmative) • 
bacterial outside --- Certainty=0. 0000 (Not Clear) < f 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < £ 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB05347 GB : AP001512 cystathionine beta-lyase [Bacillus halodurans] 
Identities = 200/384 (52%) , Positives = 262/384 (68%) , Gaps = 3/384 (0%) 

5 Query: 79 IAEVYEMRENTTLLHGYTVIDEFTGAASVPIYQTSTFHNSELYCPSQKHLYTRFSNPTTE 138 

++E Y ++ T LLH +D+ TGA SVPI STFH + + + Y+R NPT + 

Sbjct: 1 MSEQYSLQ- -TKLLHKEHKVDQATGAVSVPIQHASTFHQFD- FDTFGTYDYSRSGNPTRD 57 

Query: 139 ALEDGLACLEKATYAVAYASGMRRISTVLMLLKAGDHVIFPLEVYGGTCQFATAILPNYQ 198 
10 ^ ALE +A LE + A+ASGMAAIST MLL GDHV+ +VYGGT + T +L 

Query: 199 IETSFVDMADLATVKASIRPISrrRMIYLETPSNPLLKICDISELVQIAI<AYGVLTVADNTF 258 
IE 4-FVDM +LA V A+IRPNTR++Y+ETPSNP L I DI +V LAK + LT DNTF 
15 Sbjct: 118 IEHTFVDMTNLAEVAAAIRPOTRVLYMETPSNPTLMITDIRGWSIAKEHECLTFLDNTF 177 

Query: 259 MTSLYQEPIiAMGVDIWESVTKFINGHSDWAGLAATIINFAIYNQLKLFQKNFGAIVGVE 318 

+T Q PL +GVD+V+ S TKFI GHSDWAGLA T NE + +L Q +FGAI+GV+ 
Sbjct: 178 LTPALQRPLELGVDWLHSATKFIGGHSDWAGLAVTKNEELGKKLAFLQNSFGAILGVQ 237 

20 

Query: 319 DAWLILRGMKTMGIRMEQAVKNAQQLAOTIiAKHPK^»mYPGLDSHPlfflDTHLQQAKNG 378 

D WL+LRG+KT+ +RME K AQQ+A +L P+V +V+YPGL HP H+ +QA+ 
Sbjct: 238 DVWLVLRGLKTLHVRMEHGEKGAQQIAEWLQGVPEVKRVYYPGLKDHPGHELQKRQAEGF 297 

25 Query: 379 GAVLSFELASKEELMTFTHRIQLPILAVSLGGVESILSHPATMSHACLSPQARLEQGWD 438 

GAVLSFEL ++E + F ++LP+ AVSLG VESILS+PA MSHA + + R +G+ D 
Sbjct: 298 GAVLSFELFmFATORFVEHVKLPVFAVSLGAVESILSYPAKMSHAAMPKEEREARGIRD 357 

Query: 439 GLLRLSCGVENIEDLLADFEQALA 462 
30 GLLRLS G+E E+L+ADF+ A A 

Sbjct: 358 GIiLRLSVGLEKPEELMADFKAAFA 381 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2417 

A DNA sequence (GASxl78) was identified in S.pyogenes <SEQ ID 7333> which encodes the amino acid 
sequence <SEQ ID 7334>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

40 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1492 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
50 antigens for vaccines or diagnostics. 

Example 2418 

A DNA sequence (GASxl82) was identified in S.pyogenes <SEQ ID 7335> which encodes the amino acid 
sequence <SEQ ID 7336>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

55 
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>» Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2584 (Affirmative) < suco 

5 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalact.iae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2419 

A DNA sequence (GASxl87) was identified in S.pyogenes <SEQ ID 7337> which encodes the amino acid 
sequence <SEQ ID 733 8>. Analysis of this protein sequence reveals the following: 

15 Possible site: 61 

■»•> Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm --- Certainty=0. 2084 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
25 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2420 

A DNA sequence (GASxl88) was identified in S.pyogenes <SEQ ID 7339> which encodes the amino acid 

sequence <SEQ ID 7340>. Analysis of this protein sequence reveals the following: 

30 Possible site: 34 

>» Seems to have no N-terminal signal sequence 

Final Results 

35 bacterial cytoplasm Certainty=0. 2060 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 
40 The protein has homology with the following sequences in the GENPEPT database: 

>GE:AAG05515 GB-.AE004640 conserved hypothetical protein [Pseudomonas aeruginosa] 
Identities = 140/442 (31%) , Positives = 208/442 (46%) , Gaps = 73/442 (16%) 

Query: 2 KKYI^QNVYDALIERLHFLEiroFPIVyiSFSGGKDSGLLIMILLDFRDKXYPDREIG--- 58 
45 K Y + +V+ A + RL +F +F V ++FSGGKDS + L + LD RE+G 

Sbjct: 4 KHYQDADVHARTLSRLRLVFRNFERVCVAFSGGKBSSVTLQLALDVA RELGRSP 57 

Query: 59 --VFHQDFEAQYSLTTKWQETFTSLEXBKKKVSLYWCliPMATRTALSSYEMFWYPWDDK 116 
V DEQYT+VE GRV +WCLP+ R A S E +W W+ 

50 Sbjct: 58 VDVLFIDLEGQYQATIDHVSEML GRPDVRPWWVCLPLNLRNASSLEEPYWCCWEPG 113 

Query: 117 TEDIl^PMPSQDOTINLENNSITTYRYKIWQEDLftKQFGRWYKQIHGNQICrVCILGNRA 176 
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Sbjct: 114 AEADWWPLPKQRGVIS-DPAFFPFYRYRMEFEEFVAGFNAWLAR-- 

Query: 177 SESLHRYSGFINKKYGYQKEC WITKQFKDVWTAS- -PLYDWSVEDIWH 222 

ESL+RY K+ K+C W + + S P+YDW ED+W 

Sbjct: 170 DESIiNRYLaV--KRRSRA.KQCAWTPPGGSAPLAWSARDRANPQAVSFFPIYDWRFEDLWR 227 

Query: 223 AYyKFSYSYNELYDLFYKAGLKPSQMRVASPFQDYAVDSIjNLYRIIDQETWVICLLGRVQG 282 

Y+YN LYD Y+AG+ SQMR+ P+ D L+L+ 1+ TW K++ RV G 

Sbjct: 228 CVADHGYAYNRLYDQMYRAGVPFSQMRICQPYGDDQRKGLDLFHRIEPRTWFKVVRRVAG 287 

Query: 283 VWFSNIYGRTKaMGYK-SIALPKGH-SWKSYTQFLLSTLPWLRNNYVRKFNKSIDFWHK 340 

M+ Y R + +GY+ + LP +W+ Y+QFLL 4+P LR Y R+ + I +W + 
Sbjct: 288 ANYGARYCRQRFLGYRGGLGLPPSFGTWREYSQFLLRSMPPPLRGIYQRRIERFILWWKQ 347 

Query: 341 TGGGLAEETINELIEKGYRIARNGISNYTSFKHSRVIFLDQ-IPDDTDDIVTTKDIPSWK 399 

LA 1+ D IP + + PSW+ 

Sbjct: 348 HDYPLA IWPDAGIP ALENRRKQPSWR 373 

Query: 400 RMCFCILKNDHICRTMGFGLTR 421 

R+ +LK D + R++ FG ++ 
Sbjct: 374 RIALSLLKQD-MARSLSFGFSQ 394 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2421 

A DNA sequence (GASxl89) was identified in S.pyogenes <SEQ ID 7341> which encodes the amino acid 
sequence <SEQ ID 7342>. Analysis of this protein sequence reveals the following: 
Possible site: 45 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4121 (Affirmative) suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 {Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC73702 GB:AE000165 orf, hypothetical protein [Escherichia 
coli] 

Identities = 79/162 (48%), Positives = 110/162 (67%), Gaps = 1/162 (0%) 

Query: 7 PVYEIKS1PIEKISPNDYNPNSVAPPEKKLLYDSIKSDGYTMPIVCYYDKEEDRYSIVDG 66 

PV + + ++ PNDYNPN+VAPPE KLL SI+ DG+T PIV + +++ IVDG 
Sbjct: 46 PVDCVLWKNSQLMPM7YNPNIWAPPEKK1LQKSIEIDGFTQPIW 104 

Query: 67 FHRYRIMLDYSDIYERESGRLPVSVIDKSLDYRMASTIRHNRARGSHDVDLMSQIVKDLH 126 

FHR+ I S + R G LPV+ ++ + + R+A+TIRHNRARG H + MS+IV++L 
Sbjct: 105 FHRHEIGKGSSSLKLRLKGYLPVTCLEGTRNQRIAATIRHNRARGRHQITAMSEIVRELS 164 

Query: 127 ECGRSDNWIAKHLGMDKDEILRLKQITGLASLFKDHEFNQSW 168 

+ G DN I K LGMD DE+LRLKQI GI> LF D +++++W 
Sbjct: 165 QLGWDDNKIGKELGMDSDEVLRLKQINC-LQ2LFADRQYSRAW 206 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2422 

A repeated DNA sequence (GASxl92R) was identified in S.pyogenes <SEQ ID 7343> which encodes the 
amino acid sequence <SEQ ID 7344>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

5 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 4301 (Affirmative) < suco 

10 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

15 >GP:CAA63509 GB:X92946 transposase [Lactococcus lactis] 

Identities = 23/36 (63%) , Positives = 28/36 (76%) 

Query: 1 MQDKLVTEAFNQAYNREKPKEGVIVHTDQGSQYTGA 36 
MQDKLV + F OA +E P+ G+IVHTDQGSQYT + 
20 Sbjct: 134 MQDKLVRDCFLQACGKEHPQPGLIVHTDQGSQYTSS 169 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2423 

25 A DNA sequence (GASxl94R) was identified in S.pyogenes <SEQ ID 7345> which encodes the amino acid 
sequence <SEQ ID 7346>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

»i Seems to have an uncleavable N-term signal seq 

30 

Final Results 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA63508 GB:X92946 hypothetical protein [Lactococcus lactis] 
Identities = 64/96 (66%) , Positives = 78/96 (80%) 

40 

Query: 1 MPRKTFDKAFKLSAVKLILEEEQSVKMVSSTLEIHPNSLYQWIQEYEKYGESAFPGHGSA 60 

M R+ FDK FK SAVKLILEE SVK VS LE+H NSLY+W+QE E+YGESAFPG+G+A 
Sbjct: 1 MARRKFDKQFKNSAVKLILEEGYSVKEySQELEVHANSDYRWVQEVEEYGESAFPGNGTA 60 

45 Query: 61 LRHAQFETICKLEKEHKLLQEELALLKKl'QVFIjKPNR 96 

Li +AQ + K LEKE++ LQEEL LLKKF+VFLK ++ 
Sbjct: 61 LANAQHKI KLLEKENRYLQEELELLKKFRVFLKRSK 96 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
50 antigens for vaccines or diagnostics. 
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Example 2424 

A DNA sequence (GASxl95R) was identified in S.pyogenes <SEQ ID 7347> which encodes the amino acid 
e <SEQ ID 7348>. Analysis of this protein sequence reveals the following: 



Possible site: 13 

> Seems to have a cleavable N-term signal seq. 



Likelihood = 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 



■ Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 



Transmembrane 
Transmembrane 
Transmembrane 



260 - 276 



344 - 360 



173 - 2011 



410 - 435 



256 - 278 



• Certainty=0. 5522 (Affirmative) < £ 
- Certainty=0. 0000 (Not Clear) < sue 
■ Certainty=0. 0000 (Not Clear) < sue 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

membrane protein 
74/430 (63%), Gaps = B/430 (1%) 



>GP:CAB75191 GB:AL139075 putative integral 
[Campylobacter jejuni] 
Identities = 177/430 (41%) , Positives = 2' 



5 IIISAIALAIGIGYRTKINIGLLAIAFSYLIATTLMGLSPKELLHFWPTSLFFTIFSVSL 64 
+IIS+I +AI +GY T+ N+G+ A+ F+Y+I M L+PK+++ FWP S+FF IF+VSL 

6 LIISSIIVAIILGYITRHNVGIFAMIFAYIIGAFFMDLAPKKIIAFWPISIFFVIFAVSIi 65 

65 FYNVATTNGTLDVLAQHILYRTRTHPNALYMILYLIATLLSALGAGFFTTMAVCCPLAIT 124 
FYN AT NGTL+ LA H++YR HP L ++++++ +++ALGAGF+T +A PL 

66 FYNFATWGTLEKLAGHLMYRFANHPYLLPFVIFWSAIIAALGAGFYTVLAFMAPLTFL 125 

Query: 125 LCQKADKHPLIGAQAVNWGASGGANLITSGSGIVFQGLFKQMGWE-EQAFSLGNHIFIVS 183 

LC K + GA A+N+GA GGAN ITS SGI+F+GL + G E +AF+ + IF + 

Sbjct: 126 LCDKIGLSKIAGAMAINYGALGGANFITSQSGIIFRGLMENSGIEANEAFANSSIIFAFT 185 

Query: 184 IIYPLIVLLLLSCYIRYSKGRTNSSLT-IDQPPVLSKVQRQTTLLMISSMVLVWLFPLLL 242 

II P++VL + ++ + N ++ I +P Q+ T +LM +V+V +FP+L 

Sbjct: 185 IILPIWL SFFVFNAFKNNIKISVISKPDPFDYKQKTTLILMFMMIVWLIFPVLN 241 

Query: 243 LIFPNIAWIATYRQTFDIGFVSILWCLALRLKLGKQEAILAKVPWAIIIMLCGMSLLMS 302 

+IFP+ 1+ + + DI ++++ V +AL LKL ++ ++A +PW +IM+CG+ +L+S 
Sbjct: 242 IIFPHNETISYFNKKIDIAMIAMIFVAIALFLKLADEKQWALIPWGTLIMICGVGMLIS 301 

Query: 303 LAVKSGLVTLIGHLITTTIPHFWLPLFFCTIAGVMSLFSSTLSWAPTLFPIIATISAQS 362 

+AV++G + L L+ I ++PL C IA MSLFSSTL W P LFPI+ +I+A S 
Sbjct: 302 IAVF^GAIKLFSDLVENEINVIFIPLIMCAIAAFMSLFSSTLGWTPALFPIVPSIAASS 3 61 

Query: 363 PHIDIRLLTTATIIGALSTNISPFSSAGSLIQLSLPHIEERSLAFKKQILLGVPISLSLA 422 

+ LL + +4GA ++ ISPFSS GSLI S P + L FK ++ VPI A 
Sbjct: 362 -GLSEALLFSCIWGAQASAISPFSSGGSLILGSCPDKYKEKL-FKDLLIKAVPIGFIAA 419 

Query: 423 LLTIWILMLL 432 

+L 1+ + 
Sbjct: 420 ILATIIMSFI 429 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2425 

A DNA sequence (GASxl96) was identified in S.pyogenes <SEQ ID 7349> which encodes the amino acid 
sequence <SEQ ID 735 0>. Analysis of this protein sequence reveals the following: 

Possible site: 57 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Cextainty=0. 0563 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC45128 GB:U65510 nicotinate -nucleotide pyrophosphorylase 
[Rhodospirillum rubrum] 
Identities = 116/277 (41%) , Positives = 170/277 (S0%) , Gaps = 4/277 (1%) 

iDTLICAALREDV-HSEDYSTNAIFDKHGQAKVSLFAKEAGVLAGLTVFQRVFTIj 75 
)+ ++ AL ED+ + D ++ A +A A++ G+LAGL + F L 



- +G V E+ G+ R++L ER ALNFL HLSGIA+ T 



Query: 


17 


Sbjct: 


10 


Query: 


76 


Sbjct: 


70 




136 


Sbjct: 


128 


Query: 


196 


Sbjct: 


188 




255 


Sbjct: 


248 



TRKTTP BR EXYAVR GGG NHRF h DA+++KDNHIA G V A 



+ +E+EV++L AE A G + +++LDNM + +A+ +4-AGR E 
iVGHfWRIEIEVDTLEQIiAEVIAVGGAEVVLLDNMDAPTLTRAVDWAGRLVTE 247 

ITTISRFRGLAIDYVSSGSLTHSAKSLDFSM 291 

TI+ +DY+S G+LTHS +LD + 

iDTIAALAESGVDYISVGALTHSVTTLDIGL 284 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2426 

A DNA sequence (GASxl99) was identified in S.pyogenes <SEQ ID 7351> which encodes the amino acid 
sequence <SEQ ID 7352>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

»s Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1649 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2427 

A DNA sequence (GASx201) was identified in S.pyogenes <SEQ ID 7353> which encodes the amino acid 
sequence <SEQ ID 7354>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>>> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=D . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2428 

A DNA sequence (GASx203) was identified in S.pyogenes <SEQ ID 7355> which encodes the amino acid 
sequence <SEQ ID 7356>. Analysis of this protein sequence reveals the following: 
Possible site: 37 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside --- Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2429 

A DNA sequence (GASx210) was identified in S.pyogenes <SEQ ID 7357> which encodes the amino acid 
sequence <SEQ ID 7358>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

>» Seems to have an uncleavable N-term signal seq 



Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2430 

5 A DNA sequence (GASx211) was identified in S.pyogenes <SEQ ID 7359> which encodes the amino acid 
sequence <SEQ ID 7360>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

>>> Seems to have a cleavable N-term signal seq. 

10 

Final Results 

bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2431 

20 A DNA sequence (GASx213) was identified in S.pyogenes <SEQ ID 7361> which encodes the amino acid 
sequence <SEQ ID 7362>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm --- Certainty=0 .4430 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

35 Example 2432 

A DNA sequence (GASx219) was identified in S.pyogenes <SEQ ID 7363> which encodes the amino acid 
sequence <SEQ ID 7364>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

40 »> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2433 

5 A DNA sequence (GASx220) was identified in S.pyogenes <SEQ ID 7365> which encodes the amino acid 
sequence <SEQ ID 7366>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

»> Seems to have no N- terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0 . 0530 (Affirmative) < suco 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2434 

A DNA sequence (GASx231R) was identified in S.pyogenes <SEQ ID 7367> which encodes the amino acid 

sequence <SEQ ID 7368>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

25 >» Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2435 

A DNA sequence (GASx237) was identified in S.pyogenes <SEQ ID 7369> which encodes the amino acid 
sequence <SEQ ID 7370>. Analysis of this protein sequence reveals the following: 
Possible site: 52 

40 

?>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4961 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB49143 GB:AJ248283 hypothetical protein [Pyrococcus abyssi] 
Identities = 79/229 (34%) , Positives - 131/229 (56%) , Gaps « 11/229 (4%) 

NMQFPLVEIDLEHGGSWLQQI^MVra^^ 7 7 

F L+E++L G +V + G+MVY V++ TK G L+GA+ R 



Query: 


18 


Sbjct: 






78 


Sbjct: 


55 




13 8 


Sbjct: 


112 


Query: 


198 


Sbjct: 


170 



G G++ AP PG I++LEL 



C +FG +G +F++ +G G + +S+G+1+KITL G -f 



4+ +GEG+V F GHG++YIQ+ +L+ F + +BP S 
LKATLFSGEGLVFEFSGHGDVYIQTRSLDGFLSWILPHLPKS 218 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2436 

A DNA sequence (GASx240R) was identified in S.pyogenes <SEQ ID 7371> which encodes the amino acid 
sequence <SEQ ID 7372>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

>» Seems to have no N-terminal signal sequence 



- Final Results 

bacterial cytoplasm - 

bacterial membrane - 

bacterial outside - 



• Certainty=0. 2745 (Affirmative) < suco 
- Certainty=0. 0000 (Not Clear) < suco 
■ Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2437 

A DNA sequence (GASx241) was identified in S.pyogenes <SEQ ID 7373> which encodes the amino acid 
sequence <SEQ ID 7374>. Analysis of this protein sequence reveals the following: 

Possible site: 21 



Seems to have an uncleavable N- 


term signal seq 










INTEGRAL 


Likelihood = 


10.14 


Transmembrane 


196 


212 


187 


215 


INTEGRAL 


Likelihood = 


-8.01 


Transmembrane 


160 


176 


156 


179 


INTEGRAL 


Likelihood = 


-5.89 


Transmembrane 


116 


132 




134 


INTEGRAL 


Likelihood = 




Transmembrane 


74 


90 


73 


97 


INTEGRAL 


Likelihood = 


-2.66 


Transmembrane 


51 




50 


68 


INTEGRAL 


Likelihood = 


-2.60 


Transmembrane 


8 


24 


7 




INTEGRAL 


Likelihood = 


-1.28 


Transmembrane 


344 


360 


344 


360 


INTEGRAL 


Likelihood = 


-0.22 


Transmembrane 


30 


46 


30 


46 
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Final Results 

bacterial membrane Certainty=0 . 5055 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



FLLLSIIVYYMTKIYIFSFLSDITLP- - -VWKQLTI-IALALFFNQFPYLS PLLI 58 

++LL +V + KI IF + I+L ++K + LA+ F Y+ + 

WILLYTLVTHGLKIVIFFKVDGISLTFERIFKAFLFKILLAWFGMLGYMVGNVYLSYFM 64 

DPL LFLWLRQETKQLFSLKALFIAVAPSVLVDLLSRFMGTIVIPYLFLSSGIYLG 114 

+PL L ++LR+ K+L LF + P +LV+L R + V+P FL G 

EPLYGIGLSFLLLRELPKKLL LFYGLFPMILVNLFYRGVSYFVLP- -FLGQGQVYD 118 



















Query: 




Sbjct: 


119 




174 


Sbjct: 


178 




234 


Sbjct: 


234 




294 


Sbjct: 


294 




354 


Sbjct: 


354 




413 


Sbjct: 


414 



F L ++IF F 



K+L L +EQ 



+E Y +H+E+LY++VR+F+ DY + + L 1+ E + QI++1Y 



++KSLL+ K I A I NVEVP+ 1+ + LD L ++SI CD 



NAIEA++EA QPH+SIA+F G + F++ N+ K++ +DI++1F G SSKG ERG+GL 



+I++ +P SL T D F Q LT+ 
'MKIVESHPITTSIjNTTCQDHVFRQVIjTV 442 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2438 

A DNA sequence (GASx242R) was identified in S.pyogenes <SEQ ID 7375> which encodes the amino acid 
sequence <SEQ ID 7376>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 4165 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 



The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines o: 



Example 2439 

A DNA sequence (GASx243) was identified in S.pyogenes <SEQ ID 7377> which encodes the amino acid 
5 sequence <SEQ ID 7378>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

3 to have an uncleavable N-term signal seq 



INTEGRAL Likelihood = 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = 



Transmembrane 188 - 204 ( 182 - 208) 

Transmembrane 52 - 68 ( 47 - 69) 

Transmembrane 119 - 135 ( 114 - 142) 

Transmembrane 83 - 99 ( 77 - 107) 

Transmembrane 328 - 344 ( 328 - 345) 

Transmembrane 7- 23( 6- 23) 

Transmembrane 35 - 51 ( 35 - 51) 



- Final Results 

bacterial membrane Certainty=0. 5437 (Affirmative) ■ 

bacterial outside Certainty=0 . 0000 (Not Clear) < i 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < ; 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAC10175 GB:AJ278302 histidine kinase [Streptococcus pneumoniae] 
Identities = 123/438 (28%), Positives = 229/438 (52%), Gaps = 49/438 (11%) 





20 


Sbjct: 


19 




67 


Sbjct: 


79 




127 


Sbj ct : 


132 


Query: 


134 


Sbj ct : 


133 




235 

243 


Sbjct: 
Query: 


295 


Sbjct: 


303 




355 


Sbjct: 


3S3 




407 


Sbjct: 


423 



- -SLIRFISEDKMKKWVFWMNTAMFSYYFFAHFLVTVQSGFLALYF 183 

SL + I + 4K + +N M +YY L YF 
FTSLRKGILDKDFQKSLTQINWIMGAYYLVIQNLS YF 182 



v +HD N+L SL+ 1+ D++ I +YD+V++ S+ 



++S++ K 



RRPFLS IAYFKQDNKQLF 1 1 SNSTKTNRVD I AKRFDAQQCNSAH FLTVLDSY 406 

+P +SIA+FK ++ FIIENS K +DI++ F + + ++4-S4- 

CQPHVSIAFFKNGAQETFIIHNSIKEEGIDISEIFSFGASSKGEERGVGLYTVMKIVESH 422 



3 analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 



60 antigens for vaccines or 
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Example 2440 

A DNA sequence (GASx248) was identified in S.pyogenes <SEQ ID 7379> which encodes the amino acid 
sequence <SEQ ID 7380>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 5665 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2441 

A DNA sequence (GASx255) was identified in S.pyogenes <SEQ ID 738 1> which encodes the amino acid 
sequence <SEQ ID 7382>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1437 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2442 

A DNA sequence (GASx270R) was identified in S.pyogenes <SEQ ID 7383> which 
sequence <SEQ ID 7384>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -5.99 Transmembrane 20 - 36 ( 17 - 36) 

Final Results 

bacterial membrane Certainty=0. 3357 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2443 

A DNA sequence (GASx272) was identified in S.pyogenes <SEQ ID 7385> which encodes the amino acid 
sequence <SEQ ID 7386>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 24 88 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: l msrknqapkrevlpdplynskivtrlinrvmldgkrgtaativydafnaikeatgndale 60 

M RK KR+VLPDP+YNSK+V+RLIN++M+DGK+G TI+Y +F+ IKE TGNDA+E 
Sbjct: 1 MPRKGPVAKRDVLPDPIYNSKLVSRLINKMMIDGKKGKPQTILYKSFDIIKERTGNDAME SO 

Query: 61 VTETAMDNIMPVlEviyUUlVGGSNyQVPVETOPERRTTLGLRVn^VNASRARGEHTMKDRL 120 

VFE A+ NIMPVLEV+ARRVGG+NyQVPVEVRPERRTTLGLRWLVN +R RGE TM++RL 
Sbjct: 61 VFEC^^KNIMPVLETOARRVGGANYQVPvEWPERRTTLGIjRV&VNYARIjRGEKTMEERL 120 

Query: 121 AKEIMDAANOTGASVKKREDTHKMAEANRAFAHFRW 156 

A EI+DAANNTGA+VKKREDTHKMAEAN+AFAH+RW 
Sbjct: 121 ANEILDAAI^GAAVKKREDTHKMAEANKAFAHYRW 156 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2444 

A DNA sequence (GASx274) was identified in S.pyogenes <SEQ ID 7387> which encodes the amino acid 
sequence <SEQ ID 7388>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

»> Seems to have an uncleavable N-term signal seq 



Final Results 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Hot Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

A related sequence was also identified in GAS <SEQ ID 9095> which encodes the amino acid sequence 
<SEQ ID 9096>. Analysis of this protein sequence reveals the following: 

Possible cleavage site: 52 
»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty= 0.291 (Affirmative) < suco 

bacterial membrane Certainty= 0.000 (Not Clear) < suco 

bacterial outside Certainty= 0.Q0Q(Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2445 

A DNA sequence (GASx275R) was identified in S. pyogenes <SEQ ID 7389> which encodes the amino acid 
sequence <SEQ ID 7390>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

10 >>> Seems to have no N-terminal signal sequence 

: - Final Results 

bacterial cytoplasm — Certainty=0. 5664 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

15 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2446 

A DNA sequence (GASx283) was identified in S.pyogenes <SEQ ID 739 1> which encodes the amino acid 
sequence <SEQ ID 7392>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

25 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0724 (Affirmative) < suco 

30 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2447 

A DNA sequence (GASx298) was identified in S.pyogenes <SEQ ID 7393> which encodes the amino acid 
sequence <SEQ ID 7394>. Analysis of this protein sequence reveals the following: 

40 Possible site: 25 

»=. Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0. 2840 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that tins GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2448 

A DNA sequence (GASx300) was identified in S.pyogenes <SEQ ID 7395> which encodes the amino acid 
sequence <SEQ ID 7396>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

10 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.91 Transmembrane 4 - 20 ( 4-20) 

Final Results 

15 bacterial membrane Certainty=0 . 1765 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

20 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2449 

A DNA sequence (GASx301) was identified in S.pyogenes <SEQ ID 7397> which encodes the amino acid 
25 sequence <SEQ ID 7398>. Analysis of this protein sequence reveals the following: 
Possible site: 33 

»> Seems to have no N-terminal signal sequence 

30 Final Results 

bacterial cytoplasm — Certainty=0 .4884 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

35 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2450 

40 A repeated DNA sequence (GASx302) was identified in S.pyogeiies <SEQ ID 7399> which encodes the 
amino acid sequence <SEQ ID 7400>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>» Seems to have no N-terminal signal sequence 

45 

Final Results 
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bacterial cytoplasm --- fertainty=0. 2581 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2451 

10 A DNA sequence (GASx316) was identified in S.pyogenes <SEQ ID 7401> which encodes the amino acid 
sequence <SEQ ID 7402>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

>» Seems to have no N- terminal signal sequence 
15 INTEGRAL Likelihood = -0.80 Transmembrane 23 - 39 ( 22 - 39) 

Final Results 

bacterial membrane Certainty=0. 1319 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
25 antigens for vaccines or diagnostics. 

Example 2452 

A DNA sequence (GASx323R) was identified in S.pyogenes <SEQ ID 7403> which encodes the amino acid 
sequence <SEQ ID 7404>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

30 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0005 (Affirmative) < suco 

35 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

40 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2453 

A DNA sequence (GASx334) was identified in S.pyogenes <SEQ ID 7405> which encodes the amino acid 
sequence <SEQ ID 7406>. Analysis of this protein sequence reveals the following: 

45 Possible site: 17 
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»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

5 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2454 

A DNA sequence (GASx336) was identified in S.pyogenes <SEQ ID 7407> which encodes the amino acid 

sequence <SEQ ID 7408>. Analysis of this protein sequence reveals the following: 

15 Possible site: 31 

>» Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm Certainty=0 . 3379 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2455 

A DNA sequence (GASx361R) was identified in S.pyogenes <SEQ ID 7409> which encodes the amino acid 
30 sequence <SEQ ID 7410>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>>> Seems to have no N-terminal signal sequence 

35 Final Results 

bacterial cytoplasm Certainty=0. 2807 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

40 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2456 

45 A DNA sequence (GASx387) was identified in S.pyogenes <SEQ ID 741 1> which encodes the amino acid 
sequence <SEQ ID 7412>. Analysis of this protein sequence reveals the following: 
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Possible site: 16 

>» Seems to have no N-terminal signal sequence 

5 Final Results 

bacterial cytoplasm Certainty=0. 2740 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 



10 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2457 

15 A DNA sequence (GASx389) was identified in S.pyogenes <SEQ ID 741 3> which encodes the amino acid 
sequence <SEQ ID 7414>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

»> Seems to have no N-terminal signal sequence 

20 

Final Results 

bacterial cytoplasm — Certainty=0. 0744 (Affirmative) < suco 
bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

25 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

30 Example 2458 

A DNA sequence (GASx392) was identified in S.pyogenes <SEQ ID 7415> which encodes the amino acid 
sequence <SEQ ID 7416>. Analysis of this protein sequence reveals the following: 

Possible site: 2S 

35 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0, 2162 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

40 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
45 antigens for vaccines or diagnostics. 
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Example 2459 

A DNA sequence (GASx393R) was identified in S.pyogenes <SEQ ID 741 7> which encodes the amino acid 
sequence <SEQ ID 741 8>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

5 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 . 2520 (Affirmative) < suco 

10 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2460 

A DNA sequence (GASx395) was identified in S.pyogenes <SEQ ID 7419> which encodes the amino acid 
sequence <SEQ ID 7420>. Analysis of this protein sequence reveals the following: 

20 Possible site: 16 

>>> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm Certainty=0. 2590 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2461 

A DNA sequence (GASx396) was identified in S.pyogenes <SEQ ID 742 1> which encodes the amino acid 
35 sequence <SEQ ID 7422>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

>» Seems to have an uncleavable N-term signal seq 

40 Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB13373 GB:Z99111 similar to hypothetical proteins [Bacillus subtilis] 
Identities = 23/88 (26%) , Positives = 52/88 <58%) 
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Query: 4 KQERIGLWyLYYimJARKLSKFGDLYYHSKESRyLIIYINKWDLDTKLEEMRRLKCVKD 63 

+ R G+WYL+ + ++ L KFG+++Y SKR +Y+++Y + + ++ ++++ VK 
Sbjct: 2 ENRRQGMVVYLHSLKQSKMLRKFGNVHWSICRL1<Y\'VLYCDMDQIEKTMDKIASYSFVKK 61 

5 

Query: 64 IRPSAFDDIDRQFVGNLHRDETNNHQKG 91 

+ PS + +F L + + +++ G 
Sbjct: 62 VEPSYKPFLKLEFESKLDKAKEYDYKIG 89 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useM 
antigens for vaccines or diagnostics. 

Example 2462 

A DNA sequence (GASx400) was identified in S.pyogenes <SEQ ID 7423> which encodes the amino acid 
sequence <SEQ ID 7424>. Analysis of this protein sequence reveals the following: 

15 Possible site: 13 

>» Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm Certainty=0. 2010 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2463 

A DNA sequence (GASx401) was identified in S.pyogenes <SEQ ID 7425> which encodes the amino acid 
30 sequence <SEQ ID 7426>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

»> Seems to have no N-terminal signal sequence 

35 Final Results 

bacterial cytoplasm Certainty=0. 1176 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco: ' 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

40 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2464 

45 A DNA sequence (GASx402) was identified in S.pyogenes <SEQ ID 7427> which encodes the amino acid 
sequence <SEQ ID 7428>. Analysis of this protein sequence reveals the following: 

Possible site: 16 
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>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2938 (Affirmative) < suco 

5 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2465 

A DNA sequence (GASx403R) was identified in S.pyogenes <SEQ ID 7429> which encodes the amino acid 
sequence <SEQ ID 7430>. Analysis of this protein sequence reveals the following: 

15 Possible site: 21 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

20 bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2466 

A DNA sequence (GASx406) was identified in S.pyogenes <SEQ ID 743 1> which encodes the amino acid 
30 sequence <SEQ ID 7432>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

>>> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-12.26 Transmembrane 15 - 31 ( 4 - 36) 
35 INTEGRAL Likelihood = -6.64 Transmembrane 96 - 112 ( 94 - 115) 

Final Results 

bacterial membrane Certainty=0 . 5904 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

40 bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
45 antigens for vaccines or diagnostics. 
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Example 2467 

A DNA sequence (GASx408R) was identified in S.pyogenes <SEQ ID 7433> which encodes the amino acid 
sequence <SEQ ID 7434>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

5 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -2.23 Transmembrane 17 - 33 ( 15 - 34) 
INTEGRAL Likelihood = -0.85 Transmembrane 38 - 54 ( 38 - 54) 

10 Final Results 

bacterial membrane Certainty=0 . 1893 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2468 

20 A DNA sequence (GASx412) was identified in S.pyogenes <SEQ ID 7435> which encodes the amino acid 
sequence <SEQ ID 7436>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

»> Seems to have an uncleavable N-term signal seq 
25 INTEGRAL Likelihood = -6.53 Transmembrane 5 - 21 ( 4-23) 

Final Results 

bacterial membrane Certainty=0. 3612 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

30 bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2469 

A DNA sequence (GASx413) was identified in S.pyogenes <SEQ ID 7437> which encodes the amino acid 
sequence <SEQ ID 7438>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0 . 3422 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 
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>GP-.CAA68903 GB-.Y07622 lactate oxidase [Streptococcus iniae] 
Identities = 328/392 (83%) , Positives = 359/392 (90%) , Gaps = 4/392 (1%) 

Query: 3 MAQKTVITEETTDFVMDFKTSSAEGNVDFINVFDLEKMAQQVIPKGAFGYIASGAGDTFT 62 

M K+ + TT 4-+FRTSSftEG+vDF+NVFDLEKI'!AQ+VIPKGAFGYIASGaSDTFT 
Sbjct: 1 MENKSEMINATT---IEFKTSSAEGSVDFVNVFDLEKMAQKVIPKGAFGTIASGAGDTFT 57 

Query: 63 LHENIRSFNHKLIVPHSLKGVENPSTEITFDGDYLTSPLILAPVAAHKLRHEQGEVASAK 122 

LHENIRSFNHKLI PH LKGVENPSTEITF GD L SP+ILAPVAAHKLANEQGE+ASAK 
Sbjct: 58 LHENIRSFNHKLI-PHGLKGVENPSTEITFIGDKLASPIILAPVAAHKLAl'IEQGEIASAK 116 

Query: 123 GLKEFGSI'YTTSSYSTTDLPEISAALGGTPHWFQFYYSXDDGINRNIMDRVKAQGCKMV 182 

G+KEFG+IYTTSSYSTTDLPEIS LG +PHWFQFYYSKDDGINR+IMDR+KA+G K+IV 
Sbjct: 117 GVKEFGTIYTTSSYSTTDLPEISQTLGDSPHWFQFYYSKDDGINRHIMDRLKAEGVKSIV 176 

Query: 183 LTAnATVGGNREVDRRNGFVFPVGMPIVQEYLPDGAGKTMDYVYKSAKQALTSKDIEYIA 242 

LT DATVGGNREVD+RNGFVFPVGMPIVQEYLP+GAGKTMDYVYK+ KQAL+ KD+EYIA 
Sbjct: 177 LTVDATVGGIffiEVDKRNGFVFPVGMPIVQEYLPNGAGKTMDYVYKATKQALSPKDVEYIA 236 

Query: 243 TYSGLPVWKGPQCAEDTLRALDAGASGIWVTNHGGRQLDSGPAAFDSLQEVAEAVDQKV 302 

YSGLPVYVKGPQCAED RAL+AGASGIWVTNHGGRQLDGGPAAFDSLQEVAE+VD++V 
Sbjct: 237 QYSGLPVWKGPQCAEDAFRALEAGASGIWVTNHGGRQLDGGPAAFDSLQEVAESVDRRV 296 

Query: 303 PIVFDSGIRRGQHIFKALASGADLVALGRPAIYGIAMGGSIGTRQVFEKLNDELKMVMQL 362 

PIVFDSG+RRGQH+FKALASGADLVALGRP IYGIjAMGGS+GTRQVFEK+NDEIjKMVWQL 
Sbjct: 297 PIVFDSGVRRGQHVFKAIASGADLVALGRPVIYGIAMGGSVGTRQVFEKINDELBMVMQL 356 

Query: 363 AGTQTIQDVKAFNLRHNPYDSS I PFDQNALRL 394 

AGTQTI DVK F LRHNPYDSSIPF ++ 
Sbjct: 357 AGTQTIDDVKHFKLRHNPYDSSIPFSPKCFKI 388 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be -useful 
antigens for vaccines or diagnostics. 

Example 2470 

A DNA sequence (GASx414) was identified in S.pyogenes <SEQ ID 7439> which encodes the amino acid 
sequence <SEQ ID 7440>. Analysis of this protein sequence reveals the following: 
Possible site: 32 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0682 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2471 

A DNA sequence (GASx417R) was identified in S.pyogenes <SEQ ID 7441> which encodes the amino acid 
sequence <SEQ ID 7442>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 
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Pinal Results 

bacterial cytoplasm — - Certalnty=0 . 1765 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

5 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

10 Example 2472 

A DNA sequence (GASx418) was identified in S.pyogenes <SEQ ID 7443> which encodes the amino acid 
sequence <SEQ ID 7444>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

15 >>> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0 .2532 (Affirmative) < suco 

bacterial membrane --- Certainty=0 .0000 (Not Clear) < suco 

20 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
25 antigens for vaccines or diagnostics. 

Example 2473 

A DNA sequence (GASx419) was identified in S.pyogenes <SEQ ID 7445> which encodes the amino acid 
sequence <SEQ ID 7446>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

30 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3082 (Affirmative) < suco 

35 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
40 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for va&cines or diagnostics. 

\ 

Example 2474 

A DNA sequence (GASk'423) was identified in S^pyogenes <SEQ ID 7447> which encodes the amino acid 
sequence <SEQ ID 7448>>- Analysis of this protein sequence reveals the following: 

45 Possible site: 52 \^ 

\ 
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Final Results 

bacterial membrane Certainty=Q .1871 (Affirmative) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2475 

A DNA sequence (GASx427R) was identified in S.pyogenes <SEQ ID 7449> which encodes the amino acid 
sequence <SEQ ID 7450>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.17 Transmembrane 13 - 29 ( 10 - 29) 

Final Results 

bacterial membrane — Certainty=0. 1468 (Affirmative) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

A related sequence was also identified in GAS <SEQ ID 9105> which encodes the amino acid sequence 
<SEQ ID 9106>. Analysis of this protein sequence reveals the following: 

Possible site: 20 
>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.17 Transmembrane 8 - 24 

Final Results 

bacterial membrane Certainty=0 . 1470 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA26616 GB:MS3917 epidermal cell differentiation inhibitor 
[Staphylococcus aureus] ■ 
Identities = 58/195 (29%) , Positives = 106/195 (53%) , Gaps = 13/195 (6%) 

Query: 67 RWGKGLI yPRAEQEAMAAYTCQQAGPINTSLDKAKGELSQLTPELRDQVAQLDAAT 122 

+WG LI Y ++ A+ YT + + IN L A G++++L +D+V +LD++ 
Sbjct: 49 KWGNKLIKQAKYSSDDKIALYEYT-KDSSKINGPLRIiAGGDINKIiDSTTQDKVRRLDSSI 107 

Query: 123 HRLVIPWNIWYRYVYETFLRDI -GVSHADLTSYYR- -NHQFDPHILCKIK- -LGTR-YT 176 

+ P ++ VYR + +L I G ++ DL + N Q+D +++ K+ + +R Y 
Sbjct: 108 SKSTTPESVYVYRLLNLDYLTSIVGFTNEDMI^^ 167 

Query: 177 KHSFMSTTALKNGAMTHRPVETOIOTCKGAKAAFV--EPYSAVPSEVELLFPRGCQLEW 234 

+ + ST + A+ RP+E+R+ + KG KAA++ + +A + E+L PRG + V 
Sbjct: 168 EDGYSSTQLVSGAAVGGRPIELRLELPKGTKAAYLNSKDLTAYYGQQEVLLPRGTSYAVG 227 



PCT/GB01/04789 



Sbjct: 228 SVELSNDKKKI I ITA 242 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2476 

A DNA sequence (GASx428) was identified in S.pyogenes <SEQ ID 745 1> which encodes the amino acid 
sequence <SEQ ID 7452>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 .3817 (Affirmative) < suco 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2477 

A DNA sequence (GASx429) was identified in S.pyogenes <SEQ ID 7453> which encodes the amino acid 
sequence <SEQ ID 7454>. Analysis of this protein sequence reveals the following: 

Possible site: 32 , . 

>>> Seems to have a cleavable N-term signal seq. 

Pinal Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2478 

A DNA sequence (GASx431) was identified in S.pyogenes <SEQ ID 7455> which encodes the amino acid 
sequence <SEQ ID 7456>. Analysis of this protein sequence reveals the following: 

Possible site: 43 

>>> Seems to have an uncleavable N-term signal seq 

INTEGRAD Likelihood = -8.60 Transmembrane 68 - 84 ( 66 - 90) 
INTEGRAL Likelihood = -6.85 Transmembrane 22 - 38 ( 16 - 42) 
INTEGRAL Likelihood = -3.29 Transmembrane 44 - 60 ( 43 - 61) 

ae Certainty=0. 4439 (Affirmative) < suco 
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bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) <: suco 

No corresponding DNA sequence was identified in S.agalactiae. 

5 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2479 

A DNA sequence (GASx432R) was identified in S.pyogenes <SEQ ID 7457> which encodes the amino acid 
10. sequence <SEQ ID 7458>. Analysis of this protein sequence reveals the following: 

Possible site:. 22 

»> Seems to have a cleavable N-term signal seq. 

15 Final Results 

bacterial outside --- Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

20 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2480 

25 A DNA sequence (GASx434) was identified in S.pyogenes <SEQ ID 7459> which encodes the amino acid 
sequence <SEQ ID 7460>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

>>> Seems to have a cleavable N-term signal seq. 

30 

Final Results 

bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2481 

A DNA sequence (GASx435R) was identified in S.pyogenes <SEQ ID 7461> which encodes the amino acid 
sequence <SEQ ID 7462>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

45 »> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -2.50 Transmembrane 4 - 20 ( 3 - 21) 
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Final Results 

bacterial membrane Certainty=0 . 1999 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB59092 GB:M97157 pyrogenic exotoxin C [Streptococcus pyogenes] 
Identities = 110/229 (48%), Positives = 150/229 (65%), Gaps = 4/229 (1%) 





4 


Sbjct: 


6 




61 


Sbj ct : 


66 


Query: 


121 


Sbjct: 


125 




181 


Sbjct: 


185 



ID Y+ K+4 + SE+ 



N K++GNL I G 0+ L N I ++K I T QE DPKIR+YLM YKIYD SPY+ G+ 
VNHKLLGNLFISGESQQNLNNKIILEKDIVTFQEIDFKIRKYLMDNYKIYDATSPYVSGR 184 

LEIAINGNKHESFNLYDATSSSTRSDIFKKYKDNKTINMKDFSHFDIYL 229 
+EI KHE +L+D+ + TRSDIF KYION+ INMK+FSHFDIYL 

IEIGTKDGKHEQIDLFDSPNEGTRSDIFAKYKDNRIINMKNFSHFDIYL 233 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2482 

A DNA sequence (GASx436R) was identified in S.pyogenes <SEQ ID 7463> which encodes the amino acid 
sequence <SEQ ID 7464>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

>>> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2483 

A DNA sequence (GASx446) was identified in S.pyogenes <SEQ ID 7465> which encodes the amino acid 
sequence <SEQ ID 7466>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

>» Seems to have a cleavable N-term signal seq. 
Final Results 
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bacterial outside Certainty=0 .3000 (Affirmative) c suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2484 

10 A DNA sequence (GASx449) was identified in S.pyogenes <SEQ ID 7467> which encodes the amino acid 
sequence <SEQ ID 7468>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

>>> Seems to have an uncleavable N-term signal seq 
15 INTEGRAL Likelihood = -3.82 Transmembrane 3 - 19 ( 1-20) 

Final Results 

bacterial membrane Certainty=0 . 2529 (Affirmative) <: suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
25 antigens for vaccines or diagnostics. 

Example 2485 

A DNA sequence (GASx450R) was identified in S.pyogenes <SEQ ID 7469> which encodes the amino acid 
sequence <SEQ ID 7470>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

30 

>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.44 Transmembrane 21 - 37 ( 19 - 37) 

Final Results 

35 bacterial membrane Certainty=0. 1574 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

40 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2486 

A DNA sequence (GASx457R) was identified in S.pyogenes <SEQ ID 747 1> which encodes the amino acid 
45 sequence <SEQ ID 7472>. Analysis of this protein sequence reveals the following: 

Possible site: 19 
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»> Seems to have a cleavable N-term signal seg. 

INTEGRAL Likelihood =-15.34 Transmembrane 64 - 80 ( 57 - 

INTEGRAL Likelihood =-13.43 Transmembrane 97 - 113 ( 91 - 

INTEGRAL Likelihood = -5.57 Transmembrane 38 - 54 ( 32 - 

Final Results 

bacterial membrane — Certainty=0. 713 5 (Affirmative) • 

bacterial outside --- Certainty=0 .0000 (Not Clear) < i 

bacterial cytoplasm --- Certainty=0.0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
1 5 antigens for vaccines or diagnostics. 

Example 2487 

A DNA sequence (GASx476R) was identified in S.pyogenes <SEQ ID 7473> which encodes the amino acid 
sequence <SEQ ID 7474>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

20 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 .3013 (Affirmative) < suco 

25 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2488 

A DNA sequence (GASx477) was identified in S.pyogenes <SEQ ID 7475> which encodes the amino acid 
sequence <SEQ ID 7476>. Analysis of this protein sequence reveals the following: 

35 Possible site: 57 

»> Seems to have no N-terminal signal sequence 

Final Results 

40 bacterial cytoplasm Certainty=0. 1022 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
45 The protein has homology with the following sequences in the GENPEPT database: • 

>GP:CAC03521 GB:AJ276410 BlpJ protein [Streptococcus pneumoniae] 
Identities = 47/77 (61%) , Positives = 59/77 (76%) 

Query: 1 MIKFAEEIQKEELFHIIGGYSATDCKNHLIGGITSGAIAGGVGAGMATLGVGGVAGAFAG 60 
50 M+E+EL + GGYS+TDC4-N LI G+T+G I GG GAG+ATLGV G+AGAF G 

Sbjct: 5 MLSQLEVMDTEMLAKVEGGYSSTDCQNALITGVTTGIITGGTGAGLATLGVAGLAGAFVG 64 
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Query: 51 AHVGAIAGGLTCVGGML 77 

AH+GAI GGLTC+GGM+ 
Sbjct: 65 AHIGAIGGGLTCLGGMV 81 

"5 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2489 

A DNA sequence (GASx478) was identified in S.pyogenes <SEQ ID 7477> which encodes the amino acid 
10 sequence <SEQ ID 7478>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -2.07 Transmembrane 42 - 58 ( 41 - 58) 
15 INTEGRAL Likelihood = -1.59 Transmembrane 22 - 38 ( 22 - 38) 

Final Results 

bacterial membrane Certainty=0 . 1829 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAC03520 GB:AJ276410 BlpI protein [Streptococcus pneumoniae] 
25 Identities = 35/5S (62%) , Positives = 44/56 (78%) 

Query: 1 MDNFLELQFEELVNISGGKGNIGSAIGGCLGGMIiIAAAGGPITGGAAAFVCVASGI 56 

M+ F + EEL +SGG+GN+GSAIGGC+G +L+AAA GPITGGAA +CV SGI 
Sbjct: 6 MEQFSV^IJEELEIVSGGRGNLGSAIGGCIGAVLLAAATGPITGGAATLICVGSGI 61 

30 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2490 

A DNA sequence (GASx482) was identified in S.pyogenes <SEQ ID 7479> which encodes the amino acid 
35 sequence <SEQ ID 7480>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>>> Seems to have an uncleavable N-term signal seq , 

INTEGRAL Likelihood = -0.43 Transmembrane 61 - 77 ( 61 - 79) 

40 

Final Results 

bacterial membrane Certainty=0. 1171 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAC03524 GB:AJ276410 BlpM protein [Streptococcus pneumoniae] 
Identities = 22/52 (42%) , Positives = 30/52 (57%) 

50 

Query: 29 MEIKKLETFHQMTIEKLAKVEGGKNNWQANVSGVIAAGSAGAAIGFPVCGVA 80 

M+ K +E FH+M I L+ +EGGKNNWQ NV A G +G +C + 

Sbjct: 1 MDTKI^QFHEMDITMLSSIEGGKI^QTIIVLEGGGAAFGGWGLGTAICAAS 52 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2491 

A DNA sequence (GASx483) was identified in S.pyogenes <SEQ ID 748 1> which encodes the amino acid 
sequence <SEQ ID 7482>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1832 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2492 

A DNA sequence (GASx484) was identified in S.pyogenes <SEQ ID 7483> which encodes the amino acid 
sequence <SEQ ID 7484>. Analysis of this protein sequence reveals the following: 
Possible site: 21 

>>> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside --- Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) ■= suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2493 

A DNA sequence (GASx485) was identified in S.pyogenes <SEQ ID 7485> which encodes the amino acid 
sequence <SEQ ID 7486>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 . 1037 (Affirmative) < succ: 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2494 

5 A DNA sequence (GASx487) was identified in S.pyogenes <SEQ ID 7487> which encodes the amino acid 
sequence <SEQ ID 7488>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

>>> Seems to have no N-terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0 . 1086 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2495 

A DNA sequence (GASx488) was identified in S.pyogenes <SEQ ID 7489> which encodes the amino acid 
sequence <SEQ ID 7490>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

25 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 2176 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2496 

A DNA sequence (GASx489R) was identified in S.pyogenes <SEQ ID 749 1> which encodes the amino acid 
sequence <SEQ ID 7492>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

40 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2497 

A DNA sequence (GASx490) was identified in S.pyogenes <SEQ ID 7493> which encodes the amino acid 
sequence <SEQ ID 7494>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

10 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2547 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2498 

A DNA sequence (GASx491R) was identified in S.pyogenes <SEQ ID 7495> which encodes the amino acid 
sequence <SEQ ID 7496>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

25 

>» Seems to have an uncleavable N-terra signal seq 

INTEGRAL Likelihood =-10.24 Transmembrane 6 - 22 ( 3 - 28) 

Final Results 

30 bacterial membrane Certainty=0 . 5097 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2499 

A DNA sequence (GASx492) was identified in S.pyogenes <SEQ ID 7497> which encodes the amino acid 
40 sequence <SEQ ID 7498>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

>>> Seems to have an uncleavable N-term signal seq 

45 Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 
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bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

5 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2500 

A DNA sequence (GASx493) was identified in S.pyogenes <SEQ ID 7499> which encodes the amino acid 
10 sequence <SEQ ID 7500>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -0.69 Transmembrane 21 - 37 ( 21 - 37) 

15 

Final Results 

bacterial membrane --- Certainty=0. 1277 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

20 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

25 Example 2501 

A DNA sequence (GASx495R) was identified in S.pyogenes <SEQ ID 750 1> which encodes the amino acid 

sequence <SEQ ID 7502>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

30 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2891 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

35 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
40 antigens for vaccines or diagnostics. 

Example 2502 

A DNA sequence (GASx499R) was identified in S.pyogenes <SEQ ID 7503> which encodes the amino acid 
sequence <SEQ ID 7504>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

45 

»> Seems to have an uncleavable N-term signal seq 
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INTEGRAL Likelihood = -2.50 Transmembrane 3 - 19 ( 1- 20) 

Pinal Results 

bacterial membrane Certainty=0 . 1999 (Affirmative) < suco 

5 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2503 

A DNA sequence (GASx500) was identified in S.pyogenes <SEQ ID 7505> which encodes the amino acid 

sequence <SEQ ID 7506>. Analysis of this protein sequence reveals the following: 

15 Possible site: 54 

>>> Seems to have an uncleavable N-term signal seq 

Final Results 

20 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certair.ty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
25 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC77220 GB-.AE000497 orf , hypothetical protein [Escherichia 
coli] 

Identities = 262/480 (54%) , Positives = 338/480 (69%) , Gaps = 5/480 (1%) 

30 Query: 18 GMLNRHGLIAGATGTGKTOTLKVIAEQLSIjAGWVFI^^ 77 

GM NRHGLI GATGTGKTVTL+ LAE LS GVPVF+AD+ KGDL+ + +AG V++KL AR 
Sbjct: 20 GMANPJ1GLITGATGTGKTWLQIUAESLSEIGVPVFMADVKGDLTGVAQAGTVSEKLLAR 79 

Query: 78 lATIGVSDYQPQAFPVRMWDVFGQNGQPLRTTISELGPmLSRLLNLNDTQTGVLNIVFK 137 
35 L IGV+D+QP A PV +WD+FG+ G P+R T+S+LGP++L+RLLNEND Q+GVLNI+F+ 

Sbjct: 80 LKNIGVNDWQPHANPVWWDI FGEKGHPVRATVSDLGPLLIARLLNIiNDVQSGVLNI I FR 139 

Query: 138 IADEKGWLLIDLKDLQAILKEVGDHASDYSSHYGNIAKQSIGAIQRSLLTLEQEGAHQFF 197 
IAD++G LL+D KDL+AI + +GD+A + + YGNI+ S+GAIQR LL+LEQ+GA FF 
40 Sbjct: 140 IADDQGLLLLDFKDLRAITQYIGDNAK3 FQNQYGN- SSASVGAIQRGLLSLEQQGAAHFF 199 

Query: 198 GEPALDVADLMQLDVASGYGAINILSATKLFQSPTLYTTFLLWLLSELYKLDPEVGDLDK 257 

GEP LD+ D M+ D A+G G INILSA KL+Q P LY LLW+LSELY+ LPE GDL+K 
Sbjct: 200 GEPMLDIKDWMRTD-ANGKGVINILSAEKLYQMPKLYAASLLWMLSELYEQLPEAGDLEK 258 

45 

Query: 258 PKMVFFFDEAHLLFKDAPKVFLEKVEQITOLIRSKGVGIFFVTQNPLDLPETVLAQLGNR 317 

PK+VFFFDEAHLLF DAP+V L+K+EQ++RLIRSKGVG++FV4QNP D+P+ VL QLGNR 
Sbjct: 259 PKLVFFFDEAHLLFNDAPQVLLDKI EQVI RL I RS KGVGVWFVSQNPSD I PDNVLGQLGNR 318 

50 Query: 318 IQHAFPA.YTPKEQiaWVAADTFRQNPDLDVARVTTELEVGFJU^ISVLNDKGQPSIVERA 377 

+QHA RA+TPK+QKAV+ AA T R NP D + I EL GEALIS L+ K5 PS+VERA 
Sbjct: 319 VQHALFAFTPKIJQKAvKAAAQTMRANPAFDTEKAIQELGTGEALISFLDAKGSPSVVERA 378 

Query: 378 YIMPPKSSFAVLSEIESQQLVQSSPFASKYSQSIDRESAYEKLAAKVLEDNRLAQEAIAT 437 
55 " ++ P S ++E E L+ SP KY +DRESAYE L K + + Q 

Sbjct: 379 MVIAPCSRMGPVTEDERNGLINHSPVYGKYEDEVDRESAYEML-QKGFQASTEQQNNPPA 437 



Query: 438 AQREKEAKEAIKACjAATKKANRRSVGRSHKTVVEKATDAFISTTVRTIGRELVRGLLGSL 497 
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Sbjct: 438 KGKEVAVDDGILGGLKDILFGTTGPRGGKK-- 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2504 

A DNA sequence (GASx502) was identified in S.pyogenes <SEQ ID 7507> which encodes the amino acid 
sequence <SEQ ID 7508>. Analysis of this protein sequence reveals the following: 

Possible site: 49 

>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-13.59 Transmembrane 59 - 75 ( 52 - 77) 
INTEGRAL Likelihood = -9.34 Transmembrane 4 - 20 ( 1 - 24) 



Final Results 

bacterial membrane Certainty=0 . 6434 (Affirmative) < succ 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MYNLLLTILLVLSGLLEIAIFMQPQIvNPSSNVFDSSGSEALFERT:<ARGFEAFMQRFTAV 60 

M+ +L+T+L+++S L I + +Q K+ + S G+E LF + KARG + + R T V 
Sbjct: 1 MHAVLITLLVIVSIALIIVVLLQSSKSAGLSGAISGGAEQLFGKQKARGLDLILHRITVV 60 

Query: 61 L--VFFWLAIAL 70 

L +FF L IAL 
Sbjct: 61 LAVLFFVLTIAL 72 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2505 

A DNA sequence (GASx505) was identified in S.pyogenes <SEQ ID 7509> which encodes the amino acid 
sequence <SEQ ID 7510>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -1.44 Transmembrane 140 - 156 ( 138 - 156) 



Final Results 

bacterial membrane Certainty=0. 1574 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF09704 GB:AE001874 glutamine cyclotransferase [Deinococcus radiodurans] 
Identities = 81/229 (35%) , Positives = 128/229 (55%) , Gaps = 10/229 (4%) 

Query: 16 YS YDSNLYTQGLEQLNNNHILLSAGRYGFSKVGVYDL - - TQE I FSEKIAFP - DTVFAEGL 72 
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Y +D +TQGL+ L HLSG+GS+V+L + 4+S +A F EG 

Sbjct: 54 YPHDRAAFTQGLQYLGGGHYLESTGQVGESDLRVSELRGAKVLWSTPIAQALPQAFGEGS 113 

Query: 73 TVVEDYFWLLTYKEGVAYKFDKATCNCLGAYPFEGIX3MGLAYDKENQCLWMTSGNAFLQK 132 
5 T + + LT+++GVA +D T G + ++G+GWGL D ++ L M++G + L 

Sbjct: 114 TQLGSTVYQLTWQDGVALTYDARTFKETGRHRYQGEGWGLTSDGKS - - LIMSNGTSTLVW 171 

Query: 133 RDPI03FALLDTVLVAIESVPISffl^LEYVIX3YLYANIWQTNTIVKLQPDSGKWATYDI 192 
RDPK FA +V V + P+ LNELEYV G +YAN+W T+ I ++ P +GKV+ D+ 
10 Sbjct: 172 RDPKTFAAQRSVQOTDQGQPVRffi^LEWQGSVYM^LTDRIARIHPQTGKVLTWIDV 231 

Query: 193 SPLLKALNLDKSHYPDL NVLNGIAHLDQQ-RFLITGKLYPLMLEV 236 

S L + ++ + +V NGIA + ++ L+TGK +P + EV 

Sbjct: 232 SDLTREVSAAATKQGQALTFDDVPNGIAFIPERGTLLLTGKRWPTLFEV 280 

15 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2506 

A DNA sequence (GASx506R) was identified in S.pyogenes <SEQ ID 751 1> which encodes the amino acid 
20 sequence <SEQ ID 7512>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

>>> Seems to have no N-terminal signal sequence 

25 Final Results 

bacterial cytoplasm Certainty=0 .2800 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2507 

A DNA sequence (GASx507R) was identified in S.pyogenes <SEQ ID 7513> which encodes the amino acid 
35 sequence <SEQ ID 7514>. Analysis of this protein sequence reveals the following: 
Possible site: 53 
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Final Results 

bacterial membrane --- Certainty=0. 5203 (Affirmative) < suco 
50 bacterial outside — - Certainty=0. 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 
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>GP:CAB56669 GB:AL121596 putative membrane protein [Streptomyces 
coelicolor A3 (2)] 

Identities = 119/322 (35%), Positives = 182/322 (55%), Gaps = 24/322 (7%) 

LETIYILIGLQLFHTAYCTFKDKTWPVyBXSTALFWGLLGVTFV GGAFL 56 

+E +Y LIGL A D++NP + +A FWGLLG+TF GG L 

VEWLYWLIGLVFVVmVQ^DRSNPKRWTSAAFWGLLGLTFPYGTGVANATAGNGGWTL 63 

PNKVIGFIVIVIALLTLFKQWIGTLPAFNEQKAEESAHRIGNWIFLPVMLMAMISLLLA 116 
P + +G V+ L +L F + G ++ E +A R+GN IF+P + + +++++ A 

PAEPLGVAVLALIVLAGFNFLGKGVPOTTTGEQREAAAARU3NKIFVPALTIPLVAIVCA 123 



Query: 


9 


Sb j ct : 


4 


Query: 


57 


Sbjct: 


64 


Query: 


117 


Sbjct: 


124 


Query: 


168 


Sbjct: 




Query: 


228 


Sbjct: 


244 




286 


Sbjct: 


303 



QLLGALGAIFAAAGVGDVIASLIREIVPADSRFFGVLAYVLGMVIFTMIMGNAFAAFTVI 227 
QLL LG+ 1 FAAAGVGD + ++ +++P DS++F VLAY +GM +FT+IMGNAFAAF V+ 
QLLAVLGS I FAAAGVGDQVGD IMNKVLPDDS KYFAVIAYCVGMFLFTVIMGNAFAAFP VM 243 



A+ M AGF GTL TPMAANFN +P L+E+KD+ 



;03 AQLPTGIALLGCCTVIMALFAF 324 

Based on this analysis, it was predicted that this GAS-spccific protein and its epitopes, could be useful 
30 antigens for vaccines or d 



Example 2508 

A DNA sequence (GASx508R) was identified in S.pyogenes <SEQ ID 7515> which encodes the amino acid 
sequence <SEQ ID 7516>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-12.15 Transmembrane 212 - 228 (208-2 

INTEGRAL Likelihood = -8.81 Transmembrane 23 - 39 ( 17 - 

INTEGRAL Likelihood = -7.43 Transmembrane 45 - 61 ( 40 - 

INTEGRAL Likelihood = -1.49 Transmembrane 114 - 130 ( 114 - ] 

INTEGRAL Likelihood = -1.49 Transmembrane 3 - 19 ( 3 - 

Likelihood = -1.49 Transmembrane 76 - 92 ( 76 - 



Final Results 

45 bacterial membrane - 

bacterial outside - 
bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
50 The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB56670 GB:AL121596 possible integral membrane protein 
[Streptomyces coelicolor A3 (2) ] 
Identities = 77/220 (35%), Positives = 138/220 (62%), Gaps = 2/220 (0%) 

55 Query: 23 IKLIGIVIIVLGFILKCDAIATVWAGLVTALVSGISFIDFLDILGKEFTNQRLLTIFFI 82 

I L+G+V+++LGF+ + + + V VAG+VT L+ ++ ++ L G+ F + R +T++ I 
Sbjct: 2 IVLLGWWILGFVTRRNPVLWGVAGIVTGLLGKMNPLEVIJ^FGRSFADSRSVTVYAI 61 

Query: 83 TLPLIGLSETYGLKHRATQLIQRVQALTVGRLLTLYLIIRELAGLFSIR-LGGHPQFVRP 141 
60 LP+IGL E YGL+ +A LI R+ L+ GR LT+YL++R++ F + +GG Q VRP 

Sbjct: 62 VIjPVIGLLERYGLRECjARHLIGRLGKLSAGRFLTVYLLVRQTCAAFGLNSIGGPAQTVRP 121 
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Query: 142 LIQPMGEaAAKAMIGEELTDAEKDDIKflMRAANENFGNFFAQNTFVGRGGVLLIASTLEQ 201 

L+ PM EAAA+ + G +L D ++ +++ +A+ + G FF ++ F+ G +LLI G 4 
Sbjct: 122 LVAPMAFJWffiRSTGAKLPDKLREKVRSYSASADTVGVFFGEDCFIAIGSILLITGFVNS 181 

Query: 202 LGY- DGNQAKIAFSS I LIAI I S 1 1 I VAI YNYLFEKKMERQ 240 

+ D ++A +1 +A+ + +1 L +K++ER+ 

Sbjct: 182 TYHQDIEPTQLALWAIPLAVCAFLIHGARLLLMDKQLERE 221 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2509 

A DNA sequence (GASx520) was identified in S.pyo genes <SEQ ID 7517> which encodes the amino acid 
sequence <SEQ ID 7518>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 .2652 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2510 

A DNA sequence (GASx522R) was identified in S.pyogenes <SEQ ID 7519> which encodes the amino acid 
sequence <SEQ ID 7520>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>>> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2511 

A DNA sequence (GASx523) was identified in S.pyogenes <SEQ ID 752 1> which encodes the amino acid 
sequence <SEQ ID 7522>. Analysis of this protein sequence reveals the following: 

Possible site: 22 
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>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2133 (Affirmative) < suco 

5 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Hot Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2512 

A DNA sequence (GASx525) was identified in S.pyogenes <SEQ ID 7523> which encodes the amino acid 
sequence <SEQ ID 7524>. Analysis of this protein sequence reveals the following: 

15 Possible site: 14 

>» Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm Certainty=0. 23 64 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2513 

A DNA sequence (GASx535) was identified in S.pyogenes <SEQ ID 7525> which encodes the amino acid 
30 sequence <SEQ ID 7526>. Analysis of this protein sequence reveals the following: 

Possible site: 47 

>>> Seems to have no N-terminal signal sequence 

35 Final Results 

bacterial cytoplasm Certainty=0. 4223 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

40 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2514 

45 A DNA sequence (GASx536) was identified in S.pyogenes <SEQ ID 7527> which encodes the amino acid 
sequence <SEQ ID 7528>. Analysis of this protein sequence reveals the following: 
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Possible site: 59 

>>> Seems to have no N-terminal signal sequence 

5 Final Results 

bacterial cytoplasm Certainty=0 . 1102 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

10 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB85515 GB:AE000874 conserved protein [Methane-bacterium 
thermoautotrophicum] 
Identities = 82/236 (34%) , Positives = 132/236 (55%) , Gaps = 11/236 (4%) 

15 

Query: 9 MNLSIFGLKNIPYLKEGDSIEKLIEESIKTSEFFIEDNDVLCIASKWSIAEGQVMSLNE 68 
M +S+ G++ +Pt GDI LI +t + D D++ IA +VS AEG ++SL E 

20 Query: 69 IQVSDVAKEIHENIPRKDPRIIEIMIoKLVNRDLSRLDIKKNYIGCRLENGLKLTSGGIDR 128 
1+ S A +1 KDP ++E +L + + ++I +G + GID 

Sbjct: 61 I KPSPEALD IAERTG - KDPSLVEAI LG ESSEI IR VGHDFIVSETRHGFVCANAGIDE 116 

Query: 129 KSVDEVFL- -LPNNPDASAKRISEYLKKSLGKNVAWITDSDGREDKRGATQVAIGIYGI 186 
25 +VD+ LP +PD SA++I L+4-+ G+ +AV+I+D+ GR + GA VA+G+ G+ 

Sbjct: 117 SNVDDGLATPLPRDPDGSAEKILRTLQEATGRELAVIISDTQGRPFREGAVGVAVGVAGL 176 

Query: 187 HPL--RKTEVIDSQGETIKFQEETLCDMIAACAGLVMGQRGTGIPAVL1RGLDYKW 240 
P+ RK E D G +++ + D +AA A LVMGQ G+PAV+IRG Y W 

30 Sbjct:' 177 SPIWDRKGE-RDLYGRSLETTRVAVADEIAAAASLVMGQADEGVPAVIIRG--YPW 229 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2515 

35 A DNA sequence (GASx537) was identified in S.pyogenes <SEQ ID 7529> which encodes the amino acid 
sequence <SEQ ID 753 0>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

o N-terminal signal sequence 

174 - 190 ( 174 - 190) 



Final Results 

bacterial membrane Certainty=0 . 1447 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Hot Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2516 

A DNA sequence (GASx538) was identified in S.pyogenes <SEQ ID 753 1> which encodes the amino acid 
sequence <SEQ ID 7532>. Analysis of this protein sequence reveals the following: 
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Possible site: 32 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3852 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) <; suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB99212 GB:U67562 conserved hypothetical protein [Methanococcus 
jannaschii] 

Identities = 129/387 (33%) , Positives = 208/387 (53%) , Gaps = 44/387 (11%) 

Query: 18 EWERKGLGHPDTLADGIAEQIEIDYSLYCLDKFGVIPHHNFDK1IIRGGHSVQDFGGSD 77 

E+VERKGLGHPD++ DGI7AE + ++KFG I HHN D++ + GGH+ FGG 

Sbjct: 20 EIvERKGLGHPDSICDGIAESVSPJULCKMYMEKFGTILHKNTDQVELVGGHAYPKFGGGV 79 

Query: 78 FIEPIKIIFLGRASKKCFNS SIPLFKIQKKAATKYLNRILPNLDVENYVEFETL 131 

+ PI 1+ GRA+ + + +P+ KAA +YL ++L N+DV+ V + 

Sbjct: 80 MVSPIYILLSGRATMEILDKEKNEVIKLPVGTTAVKPAKEYLKKVLRNVDVDKDVIID-- 137 

Query: 132 TSDFTTKTNWFSPEAIEDLP - EYLDVPKANDTATMISYWPLTI SEEiALMIEGYFYKLD - 189 

+ S + ++ + +VP ANDT+ + Y PL+ +E L L E + + 
Sbjct: 138 CRIGQGSMDLVDVFERQKNEVPLANDTSFGVGYAPLSTTERLVLETERFLNSDEL 192 

Query: 190 KNELPTPRFTK^GDIKVMVVRNDrjEYSIRINFPLISKFFl^IDIESQLYVDKHWKIKKY 249 

KNE+P +G DIKVM +R + ++ I ++ ++ N IE V +EK++K 
Sbjct: 193 KNEIPA VGEDIKVMGLREGKKITLTIAMAWDRYVKN- IEEYKEV IEKVRKK 243 

Query: 250 IEQKYKNIS--FSIDYH -YYLTTTGSCIDFGEEGAVGRGNKTHGI I SSFR 296 

+E K 1+ + ++ H YLT TG+ + G++G+VGRGN+ +G+I+ FR 

Sbjct: 244 VEDLAKKIADGYEVEIHINTADDYERESVYLTVTGTSAEMGDDGSVGRGNRVNGLITPFR 303 

Query: 297 PNTMFAPAGKNCTYF^/GKVWGFLSDTIAKEIYEAFNT-PCQIIMQLNIGSKLYRPTHLFI 355 

P +MEA +GKN VGK++ L++ IA +1 + C + + IG + P L I 

Sbjct: 304 PMSMEAASGKNPVNHVGKIYNILANLIANDIAKLEGVKECYVRILSQIGKPINEPKALDI 363 

Query: 356 Q--TEESVD QERVLEIVNRHIiNN 376 

+ TE+S D + + EI N+ L+N 
Sbjct: 364 EIITEDSYDIKDIEPKAKEIANKWLDN 390 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2517 

A DNA sequence (GASx539) was identified in S.pyogenes <SEQ ID 7533> which encodes the amino acid 
sequence <SEQ ID 7534>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 . 1436 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2518 

A DNA sequence (GASx540) was identified in S.pyogenes <SEQ ID 7535> which encodes the amino acid 
sequence <SEQ ID 7536>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

»> Seems to have no K- terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 3956 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAD36304 GB:AE001779 conserved hypothetical protein [Thermotoga maritima] 
Identities = 105/3E3 (29%), Positives = 173/353 (48%), Gaps = 46/353 (13%) 



Query: 
Sbjct 
Query: 
Sbjct 

sbj 



.Sbjct: 



3 VIGIPTLNEM3NISRLVKQIDEYAVNL-GKEIIIINSDSKSTDGTPQIFLETKTYNT-KV 60 

V+GIP+ N A+ IS + + + V+ + +I+NSD S DGT + F+ET T+ K 
106 WGIPSYNNAETISHVARTAAQGIVDFFDGDGKIVNSDGGSADGTRERFMETDTFGLPKE 165 

61 SIVSFA-KGKGYWVRNIFEYAINHVPNFSGLILIDGDWSMKKMWLEKMFIAIESGN-DL 118 

S V E GKG +R I E+A+ + ++ +D D+ S+K W+E++ + G D 
166 SFVYEGLPGKGSAMRAIMEFALKQ--DAEAWFLDADLRSWPWWVERLAGPVLKGEADY 223 

119 IIPNYARKSFEGNATNHFIYPMLVKIFKRDMPYQCISGDFGFSRGLIKDLTLKCN--WHK 176 

+ P Y R F+G TN+ +PM ++ + + Q I GDFG R L++ K W+ 
224 OTPFYLRHRFDGTITNNVCFPMTAVLYGKKVR-QPIGGDFGVGRKLLEIYLGKPKEIWNT 282 

177 YTLGYGIDIFLTLTAILKSYKIKEIDLQSKIH- - KKSFEKIEKI FLEVSQS FFETINDNS 234 

+GIDI++T TAI +S ++ + L +K>H K + ++ +FL+V + FE + 
283 DVARFGIDIWMTTTAINESGRWQAALGTKVHDVKDPGKHLKGMFLQWGTLFELV 338 

235 LNQDKLRLNINFESHSRQFIKSSDI LSSNDIENLKLRALFLLQEEKQY 282 

I +E+ ++ K D+ S DI NLK A L+E + 

339 ITYENvWKEIWKIEDVPIYGETPQEEVPSMSIDIGNLia<LARETLEEVEYI 389 

283 LHG-LSEVEWDGI - -LSNTINNIYRYSSEEHSL YLLPLYLLRVYNY 325 

G LSEV4- G LS+ ++ 4YR + + LLP Y R + 

390 DRGILSEVKESGTLSLSSWVDTLYRSAVQYRKTRDKKWENLLPFYFARTARF 442 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2519 

A DNA sequence (GASx542) was identified in S.pyogenes <SEQ ID 7537> which encodes the amino acid 
sequence <SEQ ID 7538>. Analysis of this protein sequence reveals the following: 

Possible site: 2 0 



- Final Results 

bacterial membrane Certainty=0. 3123 (Affirmative) • 

bacterial outside Certainty=0 . 0000 (Not Clear) < s 
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bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB07427 GB:AP001519 nucleotide sugar dehydrogenase [Bacillus halodurans] 
Identities = 184/388 (47%) , Positives = 274/388 (70%) , Gaps = 3/388 (0%) 

MKITWGIGYVGLS IGLLLAKEHD VTFFD IDNKKI DLINKRQS PLKFAAINKLLC - KAKN 5 9 
M IT-t- G GYVGLS +LLA+ +DV +DI +K+D+IN R+SP+ + I + L K N 
tOTITIAGTGWGLSNAVLIiAQHNDVIAYDIVQEKVDMINNRKSPIVDREIEEFLATKELN 6 0 

IHATSSEELAYKDATFIILSLPTNL--KFNKLDTSIIEISVSNILKINKKATIVIKSTVP 117 
+ AT+ +E A+KDA F+++S PTN + N DTS +E +S++L IN A +VIKST+P 
LTATTDKEKAFKDAQFWISTPTNYDPEKNYFDTSSVEAVISDVLSINPNAVMVIKSTIP 120 

IGFTEYLRNRFHYNDIIFSPEFLREGSTIHDQLYPSRTIVGNESRNSQLFLDILTDISVE 177 
+G+T + RF+ +IIFSPEFLREGS ++D L+PSR +VG ++ +++F +L +++ 
VGYTREVNERFNTKNI I FSPEFLREGSALYDNLHPSRI WGERTQRAKI FAALLVQGAIK 180 

KDSPSLLVGSSEAEAIKLFSNAYLAQKIAFFNELDTFAEMQNLDSKKIIEAMGYDQRIGN 237 
++ L S+EAEAIKLF+N YLA ++AFFNELD++AE++ LD+K+II+ +G D RIG 
ENIDVLFTDSTEAEAIKLFANTYLAMRVAFFNELDSYAELKGLDAKQIIDGVGLDPRIGT 240 

SHNNPSFGFGGYCLPKDIKQLEYHFKEIPAPIITSISESNLLRKIHIAKMIIiNSSAKTIG 297 

+NNPSFG+GGYCLPKD KQL 4F+++P II +1 ++N RK H+A MIL EC +G 
HYNNPSFGYGGYCLPKDTKQLLANFEDVPNNIIGAI VDANDTRIODHVANMILKREPKVVG 3 00 

IYRINSKKDSDNCRESSTIDVAKLLKSSGKDVIIFEPLINQKKFLGCPLSNDFNEFIKYS 357 
IYR+ K SDN R+S+ +DV L ++G +V+++EP ++ +F G + DF EF K S 



Query: 


1 


Sb j ct : 


1 


Query: 


60 


Sbjct: 


61 


Query: 


118 


Sbjct: 


121 


Query: 


178 


Sbj ct: 


181 




238 


Sbjct: 


241 




298 


Sbjct: 


301 




358 


Sbjct: 


361 



D+IVANR+ D L++ KV+TRD++ D 
DVIVANRLSDDLKEVAEKVYTRDLYTRD 3 88 

Based on this analysis, it was predicted that this Gx\S-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2520 

A DNA sequence (GASx544R) was identified in S.pyogenes <SEQ ID 7539> which encodes the amino acid 
sequence <SEQ ID 7540>. Analysis of this protein sequence reveals the following: 

Possible site: 34 



Final Results 

bacterial membrane --- Certainty=0. 1022 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 
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Example 2521 

A DNA sequence (GASx545R) was identified in S.pyogenes <SEQ ID 7541> which encodes the amino acid 
sequence <SEQ ID 7542>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

5 

>>> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -1.49 Transmembrane 186 - 202 ( 186 - 203) 

Final Results 

10 bacterial membrane Certainty=0. 1595 (Affirmative) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

15 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2522 

A DNA sequence (GASx546R) was identified in S.pyogenes <SEQ ID 7543> which encodes the amino acid 
20 sequence <SEQ ID 7544>. Analysis of this protein sequence reveals the following: 
Possible site: 47 

»> Seems to have no N-terminal signal sequence 

25 ----- Final Results 

bacterial cytoplasm — Certainty=0 .2422 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear). < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2523 

35 A DNA sequence (GASx547R) was identified in S.pyogenes <SEQ ID 7545> which encodes the amino acid 
sequence <SEQ ID 7546>. Analysis of this protein sequence reveals the following: 

Possible site: SO 

»> Seems to have no N-terminal signal sequence 

40 

Final Results 

bacterial cytoplasm --- Certainty=0. 1612 (Affirmative) < suco 
bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2524 

A DNA sequence (GASx548) was identified in S.pyogenes <SEQ ID 7547> which encodes the amino acid 
sequence <SEQ ID 7548>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 5156 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2525 

A DNA sequence (GASx549R) was identified in S.pyogenes <SEQ ID 7549> which encodes the amino acid 
sequence <SEQ ID 7550>. Analysis of this protein sequence reveals the following: 
Possible site: 21 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2526 

A DNA sequence (GASx552) was identified in S.pyogenes <SEQ ID 755 1> which encodes the amino acid 
sequence <SEQ ID 7552>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

10 N-terminal signal sequence 

83 - 93 ( 83 - 99) 



- Final Results 

bacterial membrane Certainty=0. 1235 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 



WO 02/34771 



-2684- 



PCT/GB01/04789 



The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2527 

5 A DNA sequence (GASx553) was identified in S.pyogenes <SEQ ID 7553> which encodes the amino acid 
sequence <SEQ ID 7554>. Analysis of this protein sequence reveals the following: 

Possible site: 49 

>» Seems to have no N- terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0. 2781 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2528 

A DNA sequence (GASx554) was identified in S.pyogenes <SEQ ID 7555> which encodes the amino acid 

sequence <SEQ ID 7556>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

25 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2792 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2529 

A DNA sequence (GASx555) was identified in S.pyogenes <SEQ ID 7557> which encodes the amino acid 
sequence <SEQ ID 7558>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

40 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -0.00 Transmembrane 49 - 65 ( 49 - 65) 

Final Results 

45 bacterial membrane Certainty=0. 1001 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA36631 GB:AB016282 ORF25 [bacteriophage phi-105] 
5 Identities = 43/118 (35%) , Positives = 69/118 (58%) , Gaps = 2/118 (1%) 

Query: 3 IiDLIGRKRARDKPQNSYEGQDFSYIiFG- -RTTSGEfA/DEFKTMQTTAWACVRVLAEAV 60 

LL+ + KR+ +FG +T SGE V E ++ ++ACV VL+4 + 

Sbjct: 2 LLERMFEKRSGSSDHEDGFNNILIMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDI 61 

10 

Query: 61 ASLPIHIYERTENGKEKKLDHPLYFLLHDEPNPEMSSFIFRETIMSHLLIWGNAYVQI 118 

A LPIH Y+RT+ G E+K +H ++ PNP M++F +++ +M+H+L WGNAY 1 

Sbjct: 62 AKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYI 119 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2530 

A DNA sequence (GASx556) was identified in S.pyogenes <SEQ ID 7559> which encodes the amino acid 

sequence <SEQ ID 7560>. Analysis of this protein sequence reveals the following: 

20 Possible site: 43 

»i Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm Certainty=0. 2055 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2531 

A DNA sequence (GASx557) was identified in S.pyogenes <SEQ ID 7561> which encodes the amino acid 
35 sequence <SEQ ID 7562>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

>>> Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm Certainty=0. 1696 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2532 

A DNA sequence (GASx559) was identified in S.pyogenes <SEQ ID 7563> which encodes the amino acid 
sequence <SEQ ID 7564>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1556 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



+++ R K++MR 



Query: 


68 


Sbjct: 


87 


Query: 


126 


Sbjct: 


147 




185 


Sbjct: 


207 






Sbjct: 


260 


Query: 


293 


Sbjct: 


318 


sed on 


this 



PLI+KPT +SIGV 



LYNLEPRVLVEEFIDGE EY-SLESWRNGIYTP 232 

E + EEF+ GE +Y S+E ++ +G Y P 

- -TFEAPFIAEEFLQGEYGDWYQTEGYSDYISIEGIMADGEYFP 259 



LIE AR G + IPN+ K 



antigens for vaccines or diagnostics. 
Example 2533 

A DNA sequence (GASx561) was identified in S.pyogenes <SEQ ID 7565> which encodes the amino acid 
sequence <SEQ ID 7566>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2602 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Mot Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
55 antigens for vaccines or diagnostics. 
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Example 2534 

A DNA sequence (GASx562) was identified in S.pyogenes <SEQ ID 7567> which encodes the amino acid 
sequence <SEQ ID 7568>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < succ> 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology witii the following sequences in the GENPEPT database: 

>GP:AAD06696 GB:AE001539 HISTIDYL-TRNA SYNTHETASE [Helicobacter 
pylori J99] 

Identities = 75/309 (24%), Positives = 129/309 (41%), Gaps = 35/309 (11%) 

Query: 11 KGYRRQFNQ I LLGAWG I ESAYVDAEI IVATWRGLQRFKGI KVE - - FI QLSNKNI FDVLEK 58 

KG R+F Q G ES DAEII L K + +E + + +++ I + + + 
Sbjct: 115 KGRYREFTQCDFDFIGSESLVCDAEIIQVIIASL KALDLEDFCVSINHRKILNGICE 171 

Query: 69 DLSKKLRFEDISIEAILGKYLCNNDIEIIKCLYEKDKINMELLISLISKISNKLVKQEFI 128 

E. + I L K N E +K + D ++ L+ ++ N L EF 
Sbjct: 172 YFGIAQVNEVLRIVDKLEKIGLNGVEEELKKECDLDSNTIKDLLEMVQIKQNDLSHAEFF 231 

Query: 129 -KVLVLYEYVKNPLP VDCIYPSLS NLY GTGHYSSMNYKIFIR 169 

K+ L +Y +N ++ +Y L NLY G G+Y+ + Y+ + 

Sbjct: 232 EKIAYLKDYNENLKKGIQDLERLYQLLGDLQISQNL.YKIDFSIARGLGYYTGIVYETTLN 291 

Query: 170 TKSGDIFDIADGGRIDDMVSKFNKVNVLGVCMGIGTTVLSQEI EYEIEDRIMI 222 

+ + GGR D + F+K N+ GV IG L + E + +++I 

Sbjct: 292 DMKS-LGSVCSGGRYDHLTKNFSKENLQGVGASIGIDRLIVALSEMQLLDERSTQAKVLI 350 

Query: 223 LVEKIDVKIYKNCIjELANKLSGYHCSVFEFPYKKIKKFFKHELYSRHHYIITOLDGSMEY 282 

+ Y N L + + SG V+ +KIKK F + + H ++ V G E+ 
Sbjct: 351 ACMHEEYFSYANRLAESLRQSGIFSEVYP- EAQKI KKPFSYANHKGHEFVAV- - IGEEEF 407 

Query: 283 RFSSVALKN 291 

+ +++LKN 
Sbjct: 408 KSETLSLKN 416 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2535 

A DNA sequence (GASx564) was identified in S.pyogenes <SEQ ID 7569> which encodes the amino acid 
sequence <SEQ ID 7570>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1264 (Affirmative) < suco 

bacterial membrane Certainty=O.O000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2536 

5 A DNA sequence (GASx576) was identified in S.pyogenes <SEQ ID 757 1> which encodes the amino acid 
sequence <SEQ ID 7572>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

»> Seems to have a cleavable N-term signal seq. 

10 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) c suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2537 

A DNA sequence (GASx577R) was identified in S.pyogenes <SEQ ID 7573> which encodes the amino acid 
sequence <SEQ ID 7574>. Analysis of this protein sequence reveals the following: 
Possible site: 17 

25 >>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -2.60 Transmembrane 2 - 18 ( 1-18) 

Final Results 

bacterial membrane Certainty=0 .2041 (Affirmative) < suco 

30 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2538 

A DNA sequence (GASx579) was identified in S.pyogenes <SEQ ID 7575> which encodes the amino acid 

sequence <SEQ ID 7576>. Analysis of this protein sequence reveals the following: 

40 Possible site: 13 

i>> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0 .3161 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB12286 GB:Z99106 similar to hypothetical proteins [Bacillus subtilis] 
Identities = 62/140 (44%) , Positives = 88/140 (62%) , Gaps = 3/140 (2%) 

Query: 3 LTNWQEVSLADFGKPLHHKAYVBSIKRLKTTGGRFFPKDGHLDraPRMLEEHGELIFRKIV 62 

L +++S F KP H+A +N RLKTTGGR+ +++ N + L EHG 1+ 

Sbjct: 6 LQKLTEDISETYFKKPFRHQALFNDRLKTTGGRYLLTSHNIEIjNRKYLIEHGREELIGII 65 

Query: 63 RHELCHYHLYFEGRGYHHKDRDFKDLLAQVNGLRY VPTSSKSKTNHHYSCQTCGQVY 119 

+HELCHYHL+ EG+GY H+DRDF+ LL QVN R+ + +++K + Y C TCGQ Y 
Sbjct: 66 KHELCHYHLHLEGKGYKHRDRDFRMLLQQVNAPRFCTPLKKKAENKKTYMYICTTCGQQY 125 

Query: 120 QRKRRINLAKYVCGNCHGKL 139 

+KR +N +Y CG C GK+ 
Sbjct: 126 IKKRAMNPDRYRCGKCRGKI 145 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2S39 

A DNA sequence (GASx587R) was identified in S.pyogenes <SEQ ID 7577> which encodes the amino acid 
sequence <SEQ ID 7578>. Analysis of this protein sequence reveals the following: 

Possible site: 53 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-10.40 Transmembrane 46 - 62 ( 39 - 89) 
INTEGRAL Likelihood = -5.36 Transmembrane 65 - 81 ( 63 - 89) 

Final Results 

bacterial membrane Certainty=0 . 5161 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2540 

A DNA sequence (GASx590R) was identified in S.pyogenes <SEQ ID 7579> which encodes the amino acid 
sequence <SEQ ID 7580>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2036 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2541 

5 A DNA sequence (GASx592R) was identified in S.pyogenes <SEQ ID 7581> which encodes the amino acid 
sequence <SEQ ED 7582>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

>» Seems to have a cleavable N-term signal seq. 
10 INTEGRAL Likelihood = -4.62 Transmembrane 25 - 41 ( 24 - 43} 

Final Results 

bacterial membrane Certainty=0 .2848 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2542 

A DNA sequence (GASx600) was identified in S.pyogenes <SEQ ID 7583> which encodes the amino acid 
sequence <SEQ ID 7584>. Analysis of this protein sequence reveals the following: 
Possible site: 24 

25 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -2.18 Transmembrane 3 - 19 ( 2-19) 

Final Results 

30 bacterial membrane Certainty=0. 1871 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2543 

A DNA sequence (GASx603R) was identified in S.pyogenes <SEQ ID 7585> which encodes the amino acid 
40 sequence <SEQ ID 7586>. Analysis of this protein sequence reveals the following: 
Possible site: 48 

»> Seems to have no N-terminal signal sequence 

45 Final Results 

bacterial cytoplasm Certainty=0 .3027 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 
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bacteriaa outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 VVLVVNTATKCGLTPQYOJ^QALYDTYHDKGFEVLDFPCKQFLNQAPGDAEEIHHFCSLT 60 

W+WNTA+KCG TPQ++ L+ LY+TY D+G E+L FPCNQF NQ G+ EIN FC h 
Sbjct: 25 WIVVNTASKCGFTPQFEGLEICLYETYKDQGLEILGFPCNQFANQDAGENTEIHEFCQLN 84 

Query: 61 YHTTFPRFAKIKVNGKDADPLFTWLKEEKSGPLGKRIEVJNFTKFLIDQNGQVIKRYSSKT 120 

Y TF F KIKVNGK+A PL+ +LK+E G h I+WHFTKFMD++GQVI+R++ KT 
Sbjct: 85 YGVTFTMFQKIKVNGKEAHPLYQFLKKEAKGALSGTIKWNFTKFLIDRDGQVIERFAPKT 144 

Query: 121 DPKLIEEDLKALL 133 

+P+ +EE++K LL 
Sbjct: 145 EPEEMEEEIKKLL 157 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2544 

A DNA sequence (GASx605) was identified in S.pyogenes <SEQ ID 7587> which encodes the amino acid ■ 
sequence <SEQ ID 7588>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3687 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2545 

A DNA sequence (GASx608R) was identified in S.pyogenes <SEQ ID 7589> which encodes the amino acid 
sequence <SEQ ID 7590>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certairtty=0. 1327 (Affirmative) < suco 

bacterial membrane Certainty=0 . 000D (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2546 

A DNA sequence (GASx616) was identified in S.pyogenes <SEQ ID 759 1> which encodes the amino acid 
5 sequence <SEQ ID 7592>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>>> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2547 

20 A DNA sequence (GASx617R) was identified in S.pyogenes <SEQ ID 7593> which encodes the amino acid 
sequence <SEQ ID 7594>. Analysis of this protein sequence reveals the following: 

Possible site: 36 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm --- Certainty=0 . 0677 (Affirmative) < suco 
bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2548 

A DNA sequence (GASx622R) was identified in S.pyogenes <SEQ ID 7595> which encodes the amino acid 
sequence <SEQ ID 7596>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

40 >» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -7.32 Transmembrane 4 - 20 ( 1 - 25) 

Final Results 

bacterial membrane Certainty=0. 3 930 (Affirmative) < suco 

45 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2549 

A DNA sequence (GASx632) was identified in S.pyogenes <SEQ ID 7597> which encodes the amino acid 
sequence <SEQ ID 7598>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

»> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -3.40 Transmembrane 83 - 99 ( 82 - 102) 
INTEGRAL Likelihood = -1.28 ' Transmembrane 108 - 124 ( 108 - 124) 

Final Results 

bacterial membrane Certainty=0. 2359 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2550 

A DNA sequence (GASx638) was identified in S.pyogenes <SEQ ID 7599> which encodes the amino acid 
sequence <SEQ ID 7600>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

in uncleavable N-term signal seg 

28 ( 12 - 28) 



Final Results 

bacterial membrane Certainty=0 . 1256 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2551 

A DNA sequence (GASx652R) was identified in S.pyogenes <SEQ ID 7601> which encodes the amino acid 
sequence <SEQ ID 7602>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — - Cextainty=0 .2622 (Affirmative) < suco 
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bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA74610 GB:Y14232 hypothetical protein [Bacteriophage TP901-1] 
Identities = 225/485 (46%) , Positives = 308/485 (63%) , Gaps = 20/485 (4%) 

Query: 2 RJCVAIYSRVSTINQAEEGYSIQGQIEALTKyCEnMEWKIYKNVSDAGFSGGKLERPAITE 61 

+KVAIY+RVST NQAEEG+SI QI+ LTKY EAM W++ Y+DAGFSG KLERPA+ 
Sbjct: 3 KKVAIYTRVSTTNQAEEGFSIDEQIDRLTKYAEAKGWQVSDTYTDAGFSGAKLERPAMQR 62 

Query: 62 LIEDGKl^KFDTILVYKLDRLSRNVXDTLYLVKDVFTANNIHFVSLKENIDTSSAMGNLF 121 

LI D +N FDT+LVYKLDRLSR+V+DTLYLVKDVFT N I F+SL E+IDTSSAMG+LF 
Sbjct: 63 LINDIENKAFDTVLVYKLDRLSRSVRDTLYLVKDVFTKNKIDFISLNESIDTSSAMGSLF 122 

Query: 122 LTLLSAIAEFEREQIKERMQFGVMNRAKSGKTTAWKTPPYGYRYNKDEKTLSVNELEAAN 181 

LT+LSAI EFERE IKERM G + RAKSGK+ W +GY +N+ L + L+A 
Sbjct: 123 LTILSAINEFERENIKERMTMGKLGRAKSGKSI^MWTKTAFGYYHNRKTGILEIVPLQATI 182 

Query: 182 WQMFDMIISGCSIMSIWARDN-FVGN--TWTHVKVKRILENETYKGLVKYREQTFSG 238 

V Q+F +SG S+ + + ++ +G W++ +++ L+N Y G +K+++ F G 
Sbjct: 183 VEQIFTDYLSGISLTKLRDKLNESGHIGKDIPWSYRTLRQTLDNPVYCGYIKFKDSLFEG 242 

Query: 239 DHQAIIDEKTYNKAQIALAHRT DTKTNTRPFQGKYMLSHIAKCGYCGAPLKVCTGR 294 

H+ II +TY K Q L R + N RPFQ KYMLS +A+CGYCGAPLK+ G 

Sbjct: 243 MHKPIIPYETYLKVQKELEERQQQTYERNNNPRPFQAKYMLSGMARCGYCGAPLKIV1GH 302 



Query: 295 AKNDGIWQTWCVNKTESLAHRSVNNYNNQKICNTGRYEKKHIEKYVIDVLYKLQHDKE 354 

+ DG+R Y C N+ + + YN+ K C++G Y+ ++E VID L Q + + 

Sbjct: 303 KRKDGSRTMKYHCANRFPR-KTKGITVYNDNKKCDSGTYDLSNLENTVIDNLIGFQENND 361 

Query: 355 YLKKIKKDDN- - IIDITPLKKEIEIIDKKINRLNDLYINDLIDLPKLKKDIEELNHLKDD 412 

L KI +N I+D + KK+I IDKKI + +DLY+ND I + +LK + L K 
Sbjct: 362 SLLKIINGNNQPILDTSSFKKQISQIDKKIQKNSDLYliNDFITMDELKDRTDSLQAEK-- 419 

Query: 413 YNKAIKLNYLDKKNEDSLGML MDNLD I RKS S YDVQSRI VKQL I DRVE VTMDNID 456 

K +K + K DS + + ++ I + SYD + +IV L+ +V+VT DN+D 

Sbjct: 420 --KLLKAKISENKFTTOSTDVFELVKTQLGSIPIIffiLSYDNKKKIVNNLVSKVDVTADNVD 477 

Query: 467 IIFKF 471 

IIFKF 
Sbjct: 478 IIFKF 482 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2552 

A DNA sequence (GASx653R) was identified in S.pyogenes <SEQ ID 7603> which encodes the amino acid 
sequence <SEQ ID 7604>. Analysis of this protein sequence reveals the following: 

Possible site: 48 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -1.22 Transmembrane 86 - 102 ( 86 - 102) 

Final Results 

bacterial membrane Certainty=0. 1489 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=O.OOO0 (Not Clear) < suco 



60 No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF12707 GB:AF066865 unknown [bacteriophage TPW22] 
Identities = 45/67 (67%), Positives = 53/67 (78%), Gaps = 2/67 (2%) 

5 Query: 57 EKEATOCPKCKSTOTGFMQQGKKTFSWKAVaGTLLIG--GTVMGFLGEKGKKQWHCNEC 114 

+K A++CPKCKST+V FMQQGKK FSV KAV G +L G GT+ GF G+KGKKQWHCN C 
Sbjct: 13 8 DKHAIKCPKCKSTDWFMQQGKKGFSVGKAVGGAVLTGGIGTLAGFAGKKGKKQWHCHNC 197 

Query: 115 SCIFETK 121 
10 +FETK 

Sbjct: 198 GRVFETK 204 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

15 Example 2553 

A DNA sequence (GASx655) was identified in S.pyogenes <SEQ ID 7605> which encodes the amino acid 

sequence <SEQ ID 7606>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

20 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3956 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

25 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB63661 GB:AJ251789 Cro protein [Lactobacillus casei 
30 bacteriophage A2] 

Identities = 43/76 (56%) , Positives = 55/76 (71%) 

Query: 26 MT INLKRLKAERI ASGMTQCEVAQSMGWKTRTPYAKRENG I VS IGADELAKITLIFGLP I 65 
MT4NLKRL+AERIA GM Q E+A++MGW TR+ YAKRENGI +1 A EL K+ I G 
35 Sbjct: 1 MTIJ^LKRLRAERIAKGMNQDEMAKAMGWHTRSSYAKRENGITTISATELVKMASILGYGT 60 

Query: 86 EKIAIFFDKDVPVMER 101 

++ +FF +VP ER 
Sbjct: 61 NQLDLFFTNNVPDRER 76 

40 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2554 

A DNA sequence (GASx656) was identified in S.pyogenes <SEQ ID 7607> which encodes the amino acid 
45 sequence <SEQ ID 7608>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

50 Final Results 

bacterial cytoplasm Certainty=0 .4505 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2555 

A DNA sequence (GASx657) was identified in S.pyogenes <SEQ ID 7609> which encodes the amino acid 
sequence <SEQ ID 7610>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

10 >>> Seema to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 6593 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2556 

A DNA sequence (GASx658) was identified in S.pyogenes <SEQ ID 761 1> which encodes the amino acid 
sequence <SEQ ID 7612>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 5244 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2557 

A DNA sequence (GASx660) was identified in S.pyogenes <SEQ ID 7613> which encodes the amino acid 
sequence <SEQ ID 7614>. Analysis of this protein sequence reveals the following: 

40 Possible site: 58 

»> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm --- Certainty=0. 1133 (Affirmative) < suco 
bacterial membrane Certainty=0. 0000 (Not Clear) < suco 
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bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB99331 GB:U67572 purine NTPase [Methanococcus jannaschii] 
Identities = 71/346 (20%) , Positives = 154/346 (43%) , Gaps = 52/346 (15%) 

Query: 8 MSITINKLEIENVK RIKAVKIEPSATGLTIIGGNNNQGKTSVLDAIAWAL--GGN 60 

MS+ + ++ + N K RIK K G+ I G N GK+S+ +A+ +AL G+ 

Sbjct: 1 MSMILKEIRMNNFKSHVNSR1KFEK GIVAI IGENGSGKS S I FEAVFFALFGAGS 54 

Query: 61 KYKPSQAMREGSQ VPPTLKITMSNGLIVERKGKNASIiKVIDPNGQ KG 107 

+ + +6 + V ++ +N 1+ + NG+ K 

Sbjct: 55 NFNYDTI ITKGKKSVYVELDFEVM3NFYKI IRE YDSGRGGAKLYKNGKPYATTISAVNKA 114 

Query: 108 GQQLL DSFVEELAI---NLPKFMDSTPKEKADVLI,EIIGVGDQLAELEIjKEKEIYN 160 

++L + F+ + I + KF+ P EK + + +++G+ D+ + K EI 
Sbjct: 115 TOffilLGVDRlMFMSIYIKQGEIAKFLSIjKPSEKLETVAKLLGI-DEFEKCYQKMGEIVK 173 

Query: 161 QRHAIGVIADQKEKFAKEMTYYPDAPKQLVS-ISELIQQHQAILAKNGE-NAQKR--QNV 216 

+ + E+ E+ Y + K+L + +S+L ++++ ++ N + N K+ +++ 

Sbjct: 174 E YEKRLERIEGELNYKEI^EKELKNKMSQLEEKNKKLMEIiroKMKIKKEFEDI 227 

Query: 217 ERIRYDYNQSILEVDRLRKLLADAEAKTNKLSEDLKIANTD AMDLHDESTAE I E 270 

E++ ++ L ++ L + + +++LKI D A + + EE 

Sbjct: 228 EKLFNEliffiNKKLLYEKFINKLEERKRALELKNQELKIL^ 287 

Query: 271 ANIADIDEVNRKVRANFDKDKAE-EDAKQQREQYNILTNDIESIRQ 315 

+ +DE+ RK+ + + K+ ED + +Q 1+ DIE +++ 
Sbjct: 288 KYKSLVDEI-RKIESRLRELKSHYEDYLKLTKQLEIIKGDIEKLKE 332 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2558 

A DNA sequence (GASx661) was identified in S.pyogenes <SEQ ID 7615> which encodes the amino acid 
sequence <SEQ ID 761 6>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1559 (Affirmative) < suco 

bacterial membrane.--- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2559 

A DNA sequence (GASx662) was identified in S.pyogenes <SEQ ID 7617> which encodes the amino acid 
sequence <SEQ ID 7618>. Analysis of this protein sequence reveals the following: 

Possible site: 52 
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>>> Seems to have no N-terminal signal sequence 

Final Results 

5 bacterial cytoplasm Certainty=0 .3292 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

10 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2560 

A DNA sequence (GASx663) was identified in S.pyogenes <SEQ ID 7619> which encodes the amino acid 
15 sequence <SEQ ID 7620>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

»> Seems to have no N-terminal signal sequence 

20 Final Results 

bacterial cytoplasm Certainty=0 .4867 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) c suco 

25 No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2561 

30 A DNA sequence (GASx664) was identified in S.pyogenes <SEQ ID 762 1> which encodes the amino acid 
sequence <SEQ ID 7622>. Analysis of this protein sequence reveals the following: 

Possible site: 46 

>>> Seems to have no N-terminal signal sequence 

35 

Final Results 

bacterial cytoplasm — Certainty=0 . 2141 (Affirmative) < suco 
bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2562 

A DNA sequence (GASx667) was identified in S.pyogenes <SEQ ID 7623> which encodes the amino acid 
sequence <SEQ ID 7624>. Analysis of this protein sequence reveals the following: 

Possible site: 59 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 2614 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF80834 GB:AF165214 Orf78 [Pseudomonas phage D3] 
Identities = 68/200 (34%) , Positives = 109/200 (54%) , Gaps = 10/200 (5%) 
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Sbjct: 
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71 


Sbjct: 


63 
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Sbjct: 


120 


Query: 
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Sbjct: 
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f W C CDC3N+ ++ G+ +R+ T £ 



r T Y W M++RC N N Y Y G+GI++C+ W+TF NF 
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antigens for vaccines or diagnostics. 
Example 2563 

A DNA sequence (GASx668) was identified in S.pyogenes <SEQ ID 7625> which encodes the amino acid 
sequence <SEQ ID 7626>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1476 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB75598 GB:AJ271879 putative DNA helicase [uncultured 
eubacterium] 

Identities = 42/168 (25%) , Positives = 75/168 (44%) , Gaps = 7/168 (4%) 

Query: 374 IAGPSKAGKSFALIELSIAIAEGQKWLG-WCjCEQGKvTJYVm^ELDRPSALHRFIOlvYDAM 432 

+ P AGKS ++L+ +A G LG + G V+Y+ E D P+A+H A 
Sbjct: 35 LVSPGGAGKSMLALQIAAQIAGGPDLLGVGELPTGPVIYLPAE-DPPTAIHHRLHALGAH 93 



WO 02/34771 



-2700- 



PCT/GB01/04789 



Query: 433 GLPPAWANIDIWNLRGKTVPMDKIAPKLIRRSLKKNYQA- - - VI IDPIYKVLTGDENSA 489 

A D ++ + + +LK+ + +I+D + + +EN++ 

Sbjct: 94 LSAEERQAVADC3LI.IQPLIGSLPNIMASNWPEALKRAAEGRRLMILDTLRRFHIEEENAS 153 

Query: 490 DQMAHFTNQFDKVATELGCSVIYCHHHSKGS- -QGGKKSMDRASGSGV 535 

MA + + +A + GCS+++ HH SKG+ G + GS V 

Sbjct: 154 GPMAQVIGRMEAIAADTGCSIVFLHHASKGATMMGAGDQQQASRGSSV 201 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2564 

A DNA sequence (GASx669) was identified in S.pyogenes <SEQ ID 7627> which encodes the amino acid 
sequence <SEQ ID 7628>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=o .2555 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) ^ suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2565 

A DNA sequence (GASx670) was identified in S.pyogenes <SEQ ID 7629> which encodes the amino acid 
sequence <SEQ ID 7630>. Analysis of this protein sequence reveals the following: 
Possible site: 54 

»> Seems to have no N-terminal signal s 



Final Results 

bacterial cytoplasm Certainty=0 .2921 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF74082 GB:AF212845 0RF129 [Lactococcus lactis bacteriophage 
ul36] 

Identities = 36/108 (33%) , Positives = 63/108 (58%) , Gaps = 1/108 (0%) 

Query: 8 IEFFLPMDKIPTTTHQQKKVTVINGKPHFyEPSSLKNARDKFTSLLAQHVPPSKLDGPIR 67 

++F +DK+PTT QQK + + GK FY+ KN K + + + + P++ 

Sbjct: 1 MKFEFELDKMPTT-QQQKGIKKTOGKLQFYDRRGTIO^YSLKAQIjMKNKPKECFEKNVPLK 59 

Query: 68 LOTKWLFPKIKGSTNGQYKTrKPDTDNLQKLLKDCMTELGFWNDDAQV 115 

L+V + + + Q+KT++PD DNL K L+D MT+L +++DD+Q+ 

Sbjct: 60 LSVTFFYAIKQKKRWWQWKTSRPDLDNLMKNLQDYMTKLRYYSDDSQI 107 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2566 

A DNA sequence (GASx671) was identified in S.pyogenes <SEQ ID 7631> which encodes the amino acid 
sequence <SEQ ID 7632>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

»> Seems to have no N-terminal signal sequence 

Pinal Results 

bacterial cytoplasm Certainty=0. 4294 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2567 

A DNA sequence (GASx672R) was identified in S.pyogenes <SEQ ID 7633> which encodes the amino acid 
sequence <SEQ ID 7634>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

i cleavable N-term signal seq. 

tnsmembrane 106 - 122 ( 104 - 125) 

Final Results 

bacterial membrane Certainty=0. 3548 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2568 

A DNA sequence (GASx673) was identified in S.pyogenes <SEQ ID 763 5> which encodes the amino acid 
sequence <SEQ ID 7636>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4781 (Affirmative) < suco 

bacterial membrane Certaxnty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco -> 

No corresponding DNA sequence was identified in S.agalactiae. 



WO 02/34771 PCT/GB01/04789 
-2702- 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB18697 GB:U38906 0RF22 [Bacteriophage rlt] 
Identities = 78/207 (37%), Positives = 123/207 (58%), Gaps = 2/207 (0%) 



Query: 




Sb j ct : 


7 




88 


Sbjct: 


66 




147 


Sbjct: 


126 


Query: 


207 


Sb j ct : 


186 



ELSD+A+ FL+ +IRG+NGW+HGD+L VK +YF+QN+ ++ + FS+INV+P ++ 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2569 

A DNA sequence (GASx674) was identified in S.pyo genes <SEQ ID 7637> which encodes the amino acid 
sequence <SEQ ID 7638>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

to have no N- terminal signal sequence 

122 - 138 ( 122 - 138) 



Final Results 

bacterial membrane Certainty=0 . 1001 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF63071 GB:AF158600 gpl37 [Streptococcus thermophilus 
bacteriophage Sfill] 
Identities - 66/135 (48%), Positives = 89/135 (65%), Gaps = 2/135 (1%) 

• Query: 5 PEIDIQKTKSNAKRKLREYPRWRRIANDVDTQKVTATYSFEPRQSHGVPSKPVERLALNR 64 
PEID + T KRKLREYPRWR IA+D QK+T ++F PR G +KPVE +A+ R 
Sbjct: 4 PEIDEKATLKRCKRKLREYPRWREIAHDSAEQKITQEFTFMPRG- -GGVNKPVENIAVRR 61 

Query: 65 VSAEQELDAIEQAVSMILEPERRRILYDKYIiAPYKKADKVIYTELCMSESFYYDTLDIAL 124 

V A EL+AIEQAV+ + P4 RRIL +KYLA K+I + + + + L+++ 
Sbjct: 62 TOALNELEAIEQAVNGLYRPDYRRILIEKYLAYPPKPNWQIAQSIGFERTAFQELLNNSI 121 

Query: 125 LAFAELYREGVLLVE 139 

LAFAELYR+G L+VE 
SbjCt: 122 LAFAELYRDGRLIVE 136 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
55 antigens for vaccines or diagnostics. 
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Example 2570 

A DNA sequence (GASx675) was identified in S.pyogenes <SEQ ID 7639> which encodes the amino acid 
sequence <SEQ ID 7640>. Analysis of this protein sequence reveals the following: 
Possible site: 41 

5 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=Q . 1865 (Affirmative) < suco 

10 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2571 

A DNA sequence (GASx676) was identified in S.pyogenes <SEQ ID 764 1> which encodes the amino acid 

sequence <SEQ ID 7642>. Analysis of this protein sequence reveals the following: 

20 , Possible site: 31 

>>> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm — Certainty=0 .4870 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
30 The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB07254 GB:AP001519 unknown [Bacillus halodurans] 
Identities = 194/451 (43%) , Positives = 262/451 (58%) , Gaps = 69/451 (15%) 
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Sbjct: 
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IRIVNKKIDDLVPAEYNPRLDLQPGDPEYEKLKRSIEEFGLVEPIVFNERTGRWGGHQR 
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Query: 


56 


YKAAQKLGLETVPVIVADDLSEEQIKAFRLADNKV-GEIAVVroijDLIjNEELNDILDIjDMS 


114 








K ++LG E VPV V D L + KA +A NK+ G+ + L L EEL+ L +D++ 




40 


Sbjct: 


63 


LKILRELGWEEVPVSVVD-LDDHHEKALIWALNKIEGDWDNFKLKELLEELDSGL-IDVT 


120 



Query. 115 AFGFDVLDNLDDL IEDEKDL- -DDF TGTVPDEPKSKLGDIYQLGSHKLMCG 163 

GFD + ++DL 4EDE ++ DDF +EP +K GD+4 LG H L+ G 

Sbjct: 121 LTGFDE-EEIEDLMTQFFVEDENEIKEDDFDPDEVAEEIEEPITKPGDLWHLGRHFLLVG 179 

Query: 164 DSTNGADVKKLMNGEIADLLLTDPPYNVAYEGKTKDSLTIKNDS^NDSFRQFLVNAFSS 223 

DST DVK+LM E AD++ TDPPYNV YEG T + IKND+M++ F QFL +AF + 
Sbjct: 180 DSTKIEDVKRLMGNEKADM1FTDPPYNVDYEGAT--GMKIKNDNMEDSEFYQFLFDAFVA 237 

Query: 224 ANEVMKPGAVFYIWHADSEGYNFRGACETJIGWTWQCLIWNKNSMVLGRQDYHWKHEPCL 283 

+V K G Y+ HADSEG FR A D G+ ++QCLIW KNS+VLGRQDYHW+HEP L 
Sbjct: 238 MYQVTKEGGPIWCHADSEGLTFRKAFQDSGFLLKQCLIWVKNSLVLGRQDYHWRHEPIL 297 



Query: 284 YGWKDGAGHLWASDRKQTSVID -- 305 

55 YGWK GA H W RKQ++VI+ 

Sbjct: 298 YGWKPGA&HKWYGGRKQSTVIEDPVDIAITPKVDHVLLTFNNGISSTWKVPSYEIIHDG 357 
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Query: 306 YEKPQRNGVHPTMKPVGLFDYQIKNNTKGSDIVLDLFGGSGTTLIACESNG 356 

E+P+RN HPTMKP+ L I+N++K + VLD FGGSG+TLIACE G 
Sbjct: 358 SDEGMTTWRIERPKRNADHPTMKPIALCARAIQNSSKPGERVLDPFGGSGSTLIACEQTG 417 

Query: 357 RHARLMEYDPKYVDVIIKRWEELTGESVIQL 387 

R +MEYDP Y +VII+RWEE TG++ ++L 
Sbjct: 418 R1CHMMEYDPVYAEVIIRRWEEWTGQNAVKL 448 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2572 

A DNA sequence (GASx677) was identified in S.pyogenes <SEQ ID 7643> which encodes the amino acid 
sequence <SEQ ID 7644>. Analysis of this protein sequence reveals the following: 

Possible site: 54 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 .4744 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2573 

A DNA sequence (GASx678) was identified in S.pyogenes <SEQ ID 7645> which encodes the amino acid 
sequence <SEQ ID 7646>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

10 N-terminal signal sequence 

membrane 90 - 106 ( 90 - 106) 

Final Results 

bacterial membrane Certainty=0. 1107 (Affirmative) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2574 

A DNA sequence (GASx679) was identified in S.pyogenes <SEQ ID 7647> which encodes the amino acid 
sequence <SEQ ID 7648>. Analysis of this protein sequence reveals the following: 

Possible site: 19 



WO 02/34771 



-2705- 



PCT/GB01/04789 



. Seems to have no N-terminal signal sequence 

•-- Final Results 

bacterial cytoplasm Certainty=0 .3408 (Affirmative 

bacterial membrane Certainty=0 . 0000 (Not Clear) 

bacterial outside Certainty=0. 0000 (Not Clear) ■ 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA66734 GB:X98106 minor capsid protein [Bacteriophage phigle] 
Identities = 213/494 (43%) , Positives = 323/494 (65%) , Gaps = 19/494 (3%) 



Query 
Sbjct 

Sbjct 
Query: 
Sbj 

Sbjct 
Query; 
Sbjct 

Sbjct 

Sbjct 

Sbj ct : 

Sbjct 



MGVIQKIKNLVTRSKYVM-TTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYL 59 
MG+IQ+IK+L + T SL+ ITD P+I+I EY RI T+L YY + Y 

MGLIQRIKDLFWKGAAA.TGVTGSLSKITDDPRISIDPDEYWIQTDLDYYSDKLQYIHYQ 6 0 



119 ERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQDVSSAAWIKSVKTINGK 178 

E LE +ALGG AMRPY+DG+ +++A+V+A F PLQSNT D+S AA+ ++ -fT + + 
121 EFJU^EKGVALG<5FAMRPYIDGNHIKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQ 180 



179 EVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLS- -EVYKDLKDEAKVTDVTRP 236 

YYTL+EFH+WQ + Y I+NELY+SD VG++VPLS VYK+L + ++ + RP 
181 TKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQvTISGLQRP 240 

237 IFTYLKTPGMNNroiNSPLGLSIFDNAKTriDFINTTYDEFMTOVKMGQRRVAVPESLTA 296 

+F Y KTPG NN +1 SPLGL + DNAK +D IN T+D+F+WE+++GQ+ +AV + 
241 LFAYFKTPGANNINIESPLGLGWDNAKHVLDDINDTHDQFIWE1RLGQKHIAVQEGMLR 300 



357 MQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVTLVEQSLKELVISIFEIAKAYD 416 

+QIG+S G FS+ +KTATE+VS NS TYQ R+S +T+VE+++ EL SIFE+A A 
354 VQIGLSTGTFSYSKDGVKTATEWSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGA 413 

417 LYQSEVP- -SMDNISISL DDGVFTDRDAELDYWI KWNAGFGTREMAIQKVLNV 468 

L+ P ++D+ S L DDGVF ++D +L+ KV+ G +++ +Q+ + 

414 LFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGM 473 



T+E+A E A+I -I 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2575 

A DNA sequence (GASx680) was identified in S.pyogenes <SEQ ID 7649> which encodes the amino acid 
sequence <SEQ ID 7650>. Analysis of this protein sequence reveals the following: 



Possible site: 48 
>» Seems to have r 



N-terminal signal sequence 



• Final Results 

bacterial cytoplasm Certainty=0 .1840 (Affirmative) • 

bacterial membrane Certainty=0 . 0000 (Not Clear) < : 
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bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

5 >GP:CAB53790 GB:AJ242593 gp4 [Bacteriophage A118] 

Identities = 114/385 (29%) , Positives = 187/385 (47%) , Gaps = 23/385 (5%) 

Query: 8 l^EQLLLEASQLSDMYHQLTLDLFDQVIERI 67 
L QL L + D+Y L +LF ++ R+K + + S ADN WQ KL4- V L+ 
10 Sbjct: 3 LTPRQLDLFVQPIVDWTGLENELFTLIVRRLKTKKNIS-ADNVLAWQIEKLNQVIIALDQ 61 

Query: 68 DNIKLIAKYSGIAEAQLRYIIKNEGFKiyKNTSEQLEEALGRESGV NSTIQDD 12 0 

1+ I+K SG++ +L ++K+ G+ K + E+G TI D 

Sbjct: 62 QMIERISKASGVSAKKLFSWKDAGYSDLKQVDNYFSKLA.- - EAGAVLPLVSDGQTIVDK 119 

15 

Query: 121 LSNYARQAIDDVHNLTOTTLPFSVIGAYCGIIQDAVAGVVTGLKTPDQAINQTVIKWFKK 180 

+ + + + N T+ Y II + V+ GLKT QA+ +TV K+ + 

Sbjct: 120 VMRSYFKIAESI^KRINQTMLSQARQIYSDIIKETTQSVIAGLKTHRQALAETVTKFAEN 179 

20 Query: 181 GFYGFTDKAGRICWRADSYARTVINTTTWRVFNEAICEAPAREFGIDTFYYSKKATAREMCA 240 

G DKA ++W ++Y RTV TT V+N ++ E+G+D S+ AR C+ 

Sbjct: 180 GVPALVDKANKRWTPFAYWTVTRTTVNSVYNSVEDERMNEYGVDLVRISQHVGARPTCS 239 

Query: 241 PLQHQIV TTGEAREEGGIKILALSD YGHGEPDGCLGINCKHTKTPFWGVNSK 293 

25 +Q +++ + E R + G K +++ YG+G DG G NC+H + F+ G+N 

Sbjct: 240 IVQGKVICLLSVEETRSKYGNKYMSIYSPELRYGYG--DGIFGCNCRHHRFAFIEGINIA 297 

Query: 294 PEIiPEHLKNITPAQAKANRNAQAKQRAIERSIRKSKELLHVAKQLGDKEIjIRQYQSDVRS 353 
P+ E I + K +QR +ER IR +K L A++LGD+ +++ + W+ 

30 Sbjct: 298 PDESE LIDEEENKRVYALSQQQRLMERDIRAAKRKLSAAEELGDELAVKKAKQAVRT 354 

Query: 354 KQDALNYLINNNAFLHRNQAREKRY 378 

KQ L + + L R +REK Y 
Sbjct: 355 KQSKLRAFVKTHN-LTRQYSREKVY 378 

35 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2576 

A DNA sequence (GASx681) was identified in S. pyogenes <SEQ ID 765 1> which encodes the amino acid 
40 sequence <SEQ ID 7652>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

»> Seems to have no N-terminal signal sequence 

45 Final Results 

bacterial cytoplasm Certainty=0. 2756 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

50 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 
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Example 2577 

A DNA sequence (GASx682) was identified in S.pyogenes <SEQ ID 7653> which encodes the amino acid 
sequence <SEQ ID 7654>: 

TLDNQSVIKAIGDTVDYIKKMYKRKWGK 

5 

Analysis of this protein sequence reveals the following: 

Possible site: 25 

>>> Seems to have a cleavable N-term signal seg. 

10 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

^ bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2578 

A DNA sequence (GASx683) was identified in S.pyogenes <SEQ ID 7655> which encodes the amino acid 

sequence <SEQ ID 7656>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

25 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 5288 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2579 

A DNA sequence (GASx685) was identified in S.pyogenes <SEQ ID 7657> which encodes the amino acid 
sequence <SEQ ID 7658>: 

GATEVGANRWSGOTGEVLGVQIWSRKCPKGTAV^ 
40 K 

Analysis of this protein sequence reveals the following: 

Possible site: 18 

45 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1750 (Affirmative) < suco 
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bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA59185 GB:X84706 major head protein [Bacteriophage Bl] 
Identities = 138/270 (51%) , Positives = 186/270 (68%) , Gaps = 6/270 (2%) 

Query: 1 ^VGTTKI^QMLDPEVIiADMIDAEVGKAIRFAPL?Uj!VDTTIjEGQPGTTLTVPK-WDYIGD 59 > 

M+ T +A +++PEVLA ++ E+ KA+RFAPIiA+VDTTL+GQPG TL P + YIGD 
Sbjct: 1 MSKQKTTI^LvNPEVrATIVSYELMKALRFAPLRQvDTTLQGQPGNTLKFPDPFTYIGD 60 

Query: 60 AEDVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHK 119 

A DVAEG 1 + ++G ++TIKKA KG EITDEA LSGYGDP+G++ KQ+ ++ +K 
Sbjct: 61 AADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 120 

Query: 120 VDADVLDALSKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEW 179 

VD D+L A ++QTV A VDGV ALDIFNDED V+++NP DA+ +R DA + 
Sbjct: 121 VDDDLLSAWTSQWSTKANVDGVQAALDIFOT)EDAQAYVLIVNPKDAAKIRKDANAKN 180 

Query: 180 LGATEVGANRWSGVYGEVLGVQIVRSRKCPKGTAYMVR KGALRIMLKRNTMVETD 235 

+G +EVGAN +++G Y +VLG QIVRS+K +G+A M + AL+++LKR VETD 

Sbjct: 181 IG-SEVGANAlINGTYADVIiGAQIWSKKIAEGSALMFKIVSNSPALKLVLKRGVQvETD 239 

Query: 236 RDITKAINQIVANKHYGVYLYKAEKAVKIT 265 

RDI I A++HY YLY K V IT 

Sbjct: 240 RDIVTKTTVITADEHYAAYLYDLTKWNIT 269 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2580 

A DNA sequence (GASx686) was identified in S.pyogenes <SEQ ID 7659> which encodes the amino acid 
sequence <SEQ ID 7660>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

>>> Seems to have an uncleavable N-term signal seq 



Final Results 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2581 

A DNA sequence (GASx687) was identified in S.pyogenes <SEQ ID 766 1> which encodes the amino acid 
sequence <SEQ ID 7662>. Analysis of this protein sequence reveals the following: 

Possible site: 54 

»> Seems to have no N-terminal signal sequence 
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Final Results 

bacterial cytoplasm Certainty=0. 2942 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-speciflc protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2582 

A DNA sequence (GASx688) was identified in S.pyogenes <SEQ ID 7663> which encodes the amino acid 
sequence <SEQ ID 7664>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 . 2844 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC00538 GB:L02496 unknown protein [Bacteriophage LL-H] 
Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 6/B6 (6%) 

Query: 24 KLIMNNQVIWS^PYVPyRDGALRGSSRANSVGVTWSGPHARAQFYGGAYNKYKSFKFKK 83 

+L + NQ+ M YVP R G LR S N G+ ++ +ARAQFYG + + 

Sbjct: 20 RLQVLNQMHQDMEQYVPKRAGFLRSQSFVNDTGIHYTAKYARAQFYGFV NGHRVRN 75 

Query: 84 YTTPGTGKRWDKRALANATIVKDWEK 109 

Y+TPGTG+RWD + A A DW+K 
Sbjct: 76 YSTPGTGRRWDLK- -AKAVYKADWQK 99 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2583 

A DNA sequence (GASx689) was identified in S.pyogenes <SEQ ID 7665> which encodes the amino acid 
sequence <SEQ ID 7666>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

»> Seems to have no N-terminal signal sequence 



• Final Results 

bacterial cytoplasm Certainty=0. 2892 (Affirmative) • 

bacterial membrane Certainty=0. 0000 (Not Clear) < i 

bacterial outside Certainty=0 . 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S 
The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA66741 GB:X98106 minor capsid protein [Bacteriophage phigle] 
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Identities = 36/109 (33%), Positives = 64/109 (58%), Gaps = 2/109 (1%) 

Query: 17 DLGlKPRLDYLTRQEDIAIYPMPGGKVN^YMDGTREISLPFEIAIKTKNQELAST\mWT 76 

+L +K L YLT + L++YP+PG +V +E G ++ + +E+ ++TKNQ+ A+T +W 
Sbjct: 16 NLPMKCTLGYLTJ^SLSLYPLPGSRVLDEDYAGNQQWQMNYEVGMRTKNQQQiOTTLWlj 75 

Query: 77 INSALSNPDL-KLPSENHSYTFISLDVE-KPFIJilDLSDQGFYIYVLDIT 123 

++ AL L S N S+ F SL + +P +++ QG+ Y L + 

Sbjct: 75 VSQALDVLTADDLVSSNGSFEFESLTINGQPSISEQDTQGYSTYQLSFS 124 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2584 

A DNA sequence (GASx690) was identified in S.pyogenes <SEQ ID 7667> which encodes the amino acid 
sequence <SEQ ID 7668>. Analysis of this protein sequence reveals the following: 
Possible site: 18 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1626 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB53798 GB:AJ242593 major tail shaft protein [Bacteriophage 
A118] 

Identities = 54/133 (40%), Positives = 77/133 (57%), Gaps = 9/133 (6%) 

Query: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 

MR KNA + +A V G + + + L++WI ++SDD + TE++ YDGDG E+ 

Sbjct: 1 MRIKNAKTKYSVAEIVAGAGEPDWKR LSKWITNVSDDGSDNTEEQGDYDGDGNEK 55 

Query: 61 TTWGVKGAYTFEGTYDPEDKAQAHIASLKYK1GDERKVWHLIVSADGKTQWLGVATVTE 120 

T V+G AYTFEGT+D ED+AQ I+K++R+ I D+T+G ATV+E 
Sbjct: 56 TVVLGYSEAYTFEGTHDREDEAQNLIVA-KRRTPENRSIMFKIEIPDTETA-IGKATVSE 113 

Query: 121 I - - IAGSGARARF 131 

I AG G A F 
Sbjct: 114 IKGSAGGGDATEF 126 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2585 

A DNA sequence (GASx691) was identified in S.pyogenes <SEQ ID 7669> which encodes the amino acid 
sequence <SEQ ID 7670>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3521 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2586 

A DNA sequence (GASx692) was identified in S.pyogenes <SEQ ID 767 1> which encodes the amino acid 
sequence <SEQ ID 7672>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

10 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3438 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB53801 GB:AJ242593 gpl5 [Bacteriophage A118] 
20 Identities = 67/191 (35%) , Positives = 110/191 (57%) , Gaps = 17/191 (8%) 





11 


FEFRGEIYPIDLSENKVLDVFDVIDDDFLNEAEKCFLCLDILLDRTDLPFTYAVD 65 






+E+ G+ Y +DL+F+ VL V D+ +D+ L++ + L +D+L D+P+ + + 


Sbjct: 




YE YEGKE YKLDIAFDimiRVIDLTBDNSLSDVPRANLAIDVLF-ADDMPWPRSNEEDE YA 7 0 


Query: 


66 


LWVYIKTNFIDAERPEKPQLDIKGNPMPWKEKEDNKKVI- - -DLSLDAEFIY 115 






+ + I TN+I E + DI GN MP D+ + I L+ DA++IY 


Sbjct: 


71 


NIEEKSLVLIDIFTNYIVKENDDGLLYDIDGNKMPSATNNNDDAEEIASYSLTQDADYIY 130 


Query: 


116 


ASFRQAYQINLLKEQNRLSWIEFKAIjMAIiPDDTVMQRIIAIRQWE-DDGEGSKKYRDNM 174 






ASF Q Y I+LL + ++ W +F+ALL +L DDT ++ II IRQ E G4G++K R+ + 


Sbjct: 


131 


ASFLQDYNIDLUDSRGKMHWYKFRALLESLRDDTTIKTIIGIRQAELPSGKGTEKERNEL 190 




175 


RKLKAKYSLDE 185 






KLK +Y L + 


Sbjct: 




I KLKNRYKLKD 201 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2587 

A DNA sequence (GASx694) was identified in S.pyogenes <SEQ ID 7673> which encodes the amino acid 
sequence <SEQ ID 7674>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

45 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4143 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

50 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAG18639 GB:AY007505 unknown [Streptococcus mitis] 
Identities = 48/157 (30%), Positives = 85/157 (53%), Gaps = 10/157 (6%) 

Query: 86 DLELSWEPDYIYKATH1TPFSIKEVLRNFGRLKINFLIHPIKYLKTGKQEVPLVNG-GTL 144 

+LE S+ P+ ++ A H S K + +LKI + P +Y KT E NG GT+ 
Sbjct: 81 ELEFSYHPESVFYA-HFLTASYKPFGWHftWQLKlKLNMQPFRYQKTVNPES--YNGPGTI 137 

Query: 145 QNPGNVQAKPILKIKGTGNGILTINDFETGLSOTCSELVIDMERHLvYKDVLSAWDNIW 204 

NPG + ++PI++++G G+ +TI ET NV+++ ID + +++ +A + 
Sbjct: 138 NNPGTIYSEPIIEVQGDGDVSITIGR-ETMYLNVKTKATIDCRQG--RQNIYNATGAVQN 194 

Query: 205 TERHRMPLFDV--GQNKISWTGS-FTITAVPNWGVKV 238 

T R R F++ G++ I++TG+ + PNW K+ 
Sbjct: 195 TLRKRGGFFEIPTGRSGITFTGNVIjRLIIRPNWRYKI 231 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2588 

A DNA sequence (GASx695R) was identified in S.pyogenes <SEQ ID 7675> which encodes the amino acid 
sequence <SEQ ID 7676>. Analysis of this protein sequence reveals the following: 

Possible site: 15 ■ 
10 N- terminal signal sequence 

15 - . 31 ( 15>- 31) 

Final Results 

bacterial membrane Certainty=0. 2041 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2589 

A DNA sequence (GASx697) was identified in S.pyogenes <SEQ ID 7677> which encodes the amino acid 
sequence <SEQ ID 7678>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3348 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 
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Query: 1 MSENIPLRVQFKRMKAAEWARSDVILLESEIGFETDTGFAEAGDGHNRFSDLGYISPLDY 60 

MSENIPLRVQFKRMKAAEWARSDVILLESEIGFETDTGFARAGDGHNRFSDLGYISPLDY 
Sbjct: 1 MSENIPLRVQFKRMKAAEWARSDVILLESEIGFF/TDTGFARAGDGHNRFSDLGYISPLiDY 60 

Query: 61 NLLTNKPNIDGIATKVETAQKLQQICADKETVYTKAESKQELDKKimKGGVMTGQIiKFKP 120 

NLLTNKPNI03LATKvETAQia^QQKftDKKr^^ 
Sbjct: 61 NLLTNKPNIDGIATKVETAQKLQQKRDI^TOTTKAESKQELDKKLI^KGGVMTGQLKFKP 120 

Query: 121 AATOAYSSSTGGATOIDLSSTRGAGVVVYSDNDTSDGPLMSLRTGKETFNQSALFVDYKG 180 

AATVAYSSSTGGAWIDLSSTRGAGVWYSDNDTSDGPLMSLRTGKETFNQSALFVDYKG 
Sbjct: 121 AATVAYSSSTGGAVNIDLSSTRGAGVWYSDNOTSDGPLMSLRTGKETFNQSALFVDYKG 180 

Query: 181 TTKAVNIAMRQPTTPNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPSIGADYDK 240 

TTNAVNIAMR TTPNFSSALNITSGNENG3AMQLRGSEKALGTLKITHENPSIGADYDK 
Sbjct: 181 TTNAVNIAMRHATTPNFSSALNITSGNENGSAMC'LRGSEKALGTLKITHENPSIGADYDK 240 

Query: 241 NAAALSIDIVKKTNGAGTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGFYAKETS 300 

NAA + + K+ NGAGTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGFYAKETS 
Sbjct: 241 KAARYPLILSKRQNGAGTAAQG1YINSTSGTTGKLLRIRNLSDDKFYVKSDGGFYAKETS 300 

Query: 301 QIDGNLKLKDPTANDHAATKAYVDKAI SELKKLILKK 337 

QIDGNLKLKDPTANDHAATKAYVDKAISELKKLILKK 
Sbjct: 301 QIDGNLKLKDPTANDHAATKAYVDKAI SELKKLILKK 337 

Based on this analysis, it was predicted that this QAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2590 

A DNA sequence (GASx698) was identified in S.pyogenes <SEQ ID 7679> which encodes the amino acid 
sequence <SEQ ID 7680>. Analysis of this protein sequence reveals the following: 
Possible site: 17 

»> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4208 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

RGD motif 54-56 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP.-AAA98102 GB:M19348 ORF [Streptococcus pyogenes phage H4489A] 
Identities = 250/648 (38%) , Positives = 351/648 (53%) , Gaps = 75/648 (11%) 

MSRDPTLILDESNLVIGKDGRVHYTFTTEDDNPKVRLASKCLGTAHFNQLMIERGDQATS 6 0 
MSRDPT ++E +L DGR + TF + + VRL S CLG +L +E + 

MSRDPTYTIMHDLSFA-DGRFYVTFKADKSSETTOLNSSCLGNTIIKKLQVEDDNTMHD 59 

YVAPWVEGTGNPTGLFKDLKEISLELTDTANSQLWSKIKLTNRGMLQEYYDGKIKTEIV 12 0 
+V P V T GL + +KE+ L+L D S LW KIK N+ ML EY + ++ 4 I 
FVKPKVT- -TQQAFGLAQQVKELDLQLKDP-KSDLWGKIKFNNKaMLVEYANKEMSSAIA 116 



Query: 


1 


Sbjct: 


1 


Query: 


61 


Sbjct: 


60 


Query: 


121 


Sbjct: 




Query: 




Sbjct: 


153 


Query: 


241 



LQAEIKASAQGLSQKYDDELRKLSAXITTTSSGTTEAYESKLAGLRAEFTRSNQGTRTEL 240 

++++ L+ +D + L K + S T ++ S+L G + L 
VESARTQLASMFDSRISGLDGKYSRLSQ-TIDSLSSRLD- - -DGVGNYSTL 199 

Query: 241 ESQISGLRAVQQSTASQISQEIRDREGAVSRVQQSLESYQRRMQDAEENYSSLTHTVRGL 300 
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VSR+ Q+ + Q ++ +A +HYSSL4- TV+GL 



Query: 301 QSDVGSETGKIQSRLTQLAGQIEQRVTRD3VMSIISGAGDSIKLAIQKAGGINAKMSGNE 360 

Q+ V SR-t- QL+ I +VT+ V + 1+ + D I AI+ + KM+G+E 

Sbjct: 249 QTTVRDNQSWATSRINQLSDLISTKVTKGDVETTIAQSYDKIAFAIRDKLPAS-KMTGSE 307 

Query: 361 1 1 SAINLNSYGVTIAGKHIALDGNTTVNGTFTTKIAEAIKIRADQI IAGTIDAARIRVIN 420 

IISAINL+ GV I GK+I LDGN+ ++ K A + A +1 G ++A+RI 

Sbjct: 308 IISAINLDRSGVKITGKNITLDGNSyISm-VIKDAHIA^MDAGKINTGYUJASRIAAEA 366 

Query: 421 LNASSIVGLDANFIK--AKIGY AIT---DLLEGKVIKARNGAMLI 460 

+ I A P K A GY A+T + G V+ A NGA 

Sbjct: 367 ITGDKIKMDYAFPNKLTANEGYFRTLFAKNIFTTSVQAVTTSASKITGGVLSATNGASRW 426 

Query: 461 DIOTAKMDFNSDATINFNSKNNALVRKDGTHTAFVHFSNATPKGYTGSALYASIGITSSG 520 

DLN+A +DFN DATINFNSKNNALVRK GT+TAFVHFSNATPKGY GSALYASIGITSSG 
Sbjct: 427 DMSANIDENRDATINFNSKNNALVRKSGTNTAFVHFSNATPKGYRGSALYASIGITSSG 486 



Query: 581 MLDMMDLYAAWALGRCMGHLAWGWNTAHSNFTSAVNRELNNYITKI 628 

M+D+N +Y A++ALGRCW H N W+ + + SA+ E N +1 + 
Sbjct: 547 MVDIJSrKMYQAIIjALGRCWLHAM^AWSW-NFDTRSAIIAEYNAHINISIL 593 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2591 

A DNA sequence (GASx699) was identified in S.pyogenes <SEQ ID 7681> which encodes the amino acid 
sequence <SEQ ID 7682>. Analysis of this protein sequence reveals the following: 
Possible site: 36 

»■> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3323 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 



Example 2592 

A DNA sequence (GASx701) was identified in S.pyogenes <SEQ ID 7683> which encodes the amino acid 
sequence <SEQ ID 7684>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1017 (Affirmative) < suco 

bacterial membrane Certainty^ 0.00 00 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useiul 
5 antigens for vaccines or diagnostics. 

Example 2593 

A DNA sequence (GASx702) was identified in S.pyogenes <SEQ ID 7685> which encodes the amino acid 
sequence <SEQ ID 7686>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

10 

>>> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -3.03 Transmembrane 2 - 18 ( 1-23) 

Final Results 

15 bacterial membrane Certainty=0. 2211 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

20 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2594 

A DNA sequence (GASx703) was identified in S.pyogenes <SEQ ID 7687> which encodes the amino acid 
25 sequence <SEQ ID 7688>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

:>>> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -3.45 Transmembrane 36 - 52 ( 36 - 55) 

30 

Final Results 

bacterial membrane Certainty=0. 2381 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC39287 GB:AF115103 orf87 gp [Streptococcus thermophilus 
bacteriophage Sfi21] 
40 Identities = 43/73 (58%) , Positives = Sl/73 (82%) 

Query: 1 MINLKLRLQNKVTLMAILGAIFLIAQQLGIlCLPSNIADIANTAVTLLvLLGVWDPTO 60 

MIN KLRLQNK TL+A+-1- A+FL+ QQ G+ +P+NI + NT V +LV+LG++TDPTTKG 
Sbjct: 8 MINFKLRLQNKATLVALISAVFLMLQQPGLHVPNNIQEGINTLVGILVILGIITDPTTKG 67 



Query: 61 LSDSEQALTYHEP 73 

++DSE+AL+Y +P 
Sbjct: 68 IADSERALSYIQP 80 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2595 

A DNA sequence (GASx707R) was identified in S.pyogenes <SEQ ID 7689> which encodes the amino acid 
5 sequence <SEQ ID 7690>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>>> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-10.35 Transmembrane 9 - 25 ( 1-27) 

10 

Final Results 

bacterial membrane Certainty=0 . 5140 (Affirmative) < succ> 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=Q . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2596 

20 A DNA sequence (GASx714R) was identified in S.pyogenes <SEQ ID 7691> which encodes the amino acid 
sequence <SEQ ID 7692>. Analysis of this protein sequence reveals the following: 
Possible site: 26 

>>> Seems to have no N- terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0 . 1401 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2597 

A DNA sequence (GASx715) was identified in S.pyogenes <SEQ ID 7693> which encodes the amino acid 
sequence <SEQ ID 7694>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

40 »> Seems to have no K- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0417 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2598 

A DNA sequence (GASx726) was identified in S.pyogenes <SEQ ID 7695> which encodes the amino acid 
5 sequence <SEQ ID 7696>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

>>> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -1.17 Transmembrane 18 - 34 ( 18 - 35) 

10 

Final Results 

bacterial membrane Certainty=0 . 1468 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2599 

A DNA sequence (GASx728R) was identified in S.pyogenes <SEQ ID 7697> which encodes the amino acid 
sequence <SEQ ID 7698>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

25 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1795 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

30 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF61314 GB:U96166 unknown [Streptococcus cristatus] 
35 Identities = 149/194 (76%) , Positives = 162/194 (82%) 



Query: 


1 


LSAIIRQSTSKRISDKRGIYL^KLVSI^CSYFTVTKTSPMIEEVRYYAKELLRLSERR 6 0 






L IIRQSTSKRIS+KR YL +KL+ LAKQS+ V KTSPM+EEVRYYA+ELLRLSERR 


Sbj ct: 


56 


LYEIIRQSTSKRISEKRIAYLTDK1IKLAKQSFCAVKKTSPMLEEVRYYAQELLRLSERR 115 




61 


QAIFDKMVASAQPLPEDKILRSIPSIVETTATSIIGELGAIRRFQSANQINAFIGIDFRH 120 






Q + + MVA AQPLPE ILRSIP I ETTATS I IGELG I RFQS NQ NAFIGID RH 


Sbjct: 


116 


QWLNDMVALAQPLPEYDI LRS I PGIAETTATS I IGELGDIHRFQSTNQFNAFIGIDLRH 175 


Query: 


121 


YESGNYLAQEHITKRGNPYAPKILFKCIHDIAFASHTNPCHIADFYEKRKRQSQTASTKP 180 






YES N+LA+EHITKRGNPYA KILFKCIH+IA ASHTNPCHIADFYEKRKRQS ASTKP 


Sbjct: 


176 


YESRNFLAKEHITKRGNPYARKILFKCIHNIASASHTNPCHIADFYEKRKRQSTIASTKP 235 


Query: 


181 


HTIASRHCLVRQCF 194 






TIAS H L+R + 


Sbj ct : 


236 


LTIAS IHRLIRTMY 249 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2600 

A DNA sequence (GASx729R) was identified in S.pyogenes <SEQ ID 7699> which encodes the amino acid 
5 sequence <SEQ ID 770O. Analysis of this protein sequence reveals the following: 

Possible site: 28 

>>> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 .2363 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2601 

20 A DNA sequence (GASx730R) was identified in S.pyogenes <SEQ ID 7701> which encodes the amino acid 
sequence <SEQ ID 7702>. Analysis of this protein sequence reveals the following: 
Possible site: 25 

>>> Seems to have an uncleavable N-term signal seq 

25 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2602 

A DNA sequence (GASx734) was identified in S.pyogenes <SEQ ID 7703> which encodes the amino acid 
sequence <SEQ ID 7704>. Analysis of this protein sequence reveals the following: 

Possible site: 52 

40 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4001 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2603 

A DNA sequence (GASx735) was identified in S.pyogenes <SEQ ID 7705> which encodes the amino acid 
sequence <SEQ ID 7706>. Analysis of this protein sequence reveals the following: 

Possible site: 55 



• Final Results 

bacterial membrane Certainty=0 . 24 55 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific. protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2604 

A DNA sequence (GASx736) was identified in S.pyogenes <SEQ ID 7707> which encodes the amino acid 
sequence <SEQ ID 7708>. Analysis of this protein sequence reveals the following: 
Possible site: 33 

>>> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3998 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2605 

A DNA sequence (GASx737) was identified in S.pyogenes <SEQ ID 7709> which encodes the amino acid 
sequence <SEQ ID 7710>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

>>> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood =-12.74 Transmembrane 77 - 93 ( 69 - 99) 
INTEGRAL Likelihood = -4.14 Transmembrane 152 - 168 ( 151 - 170) 
INTEGRAL Likelihood = -1.17 Transmembrane 196 - 212 ( 194 - 212) 



• Final Results 

bacterial membrane --- Certainty=0. 6095 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 



WO 02/34771 



-2720- 



PCT/GB01/04789 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2606 

A DNA sequence (GASx738) was identified in S.pyogenes <SEQ ID 771 1> which encodes the amino acid 
sequence <SEQ ID 7712>. Analysis of this protein sequence reveals the following: 

Possible site: 37 



.> Seems to have a cleavable 1 

INTEGRAL Likelihood =-13 

INTEGRAL Likelihood =-10 

INTEGRAL Likelihood = -7 

INTEGRAL Likelihood = -7 

INTEGRAL Likelihood = -2 

INTEGRAL Likelihood = -0 



Transmembrane 185 - 201 
132 - 14S 
208 - 224 



39 - 



7i: 



si - 114: 

179 - 207; 
130 - 158! 
204 - 225! 
152 - 169: 



• Final Results 

bacterial membrane - 

bacterial outside - 

bacterial cytoplasm - 



■ Certainty=0. 6265 (Affirmative) ■ 

■ Certainty=0. 0000 (Not Clear) < i 
• Certainty=0. 0000 (Not Clear) < i 



No corresponding DNA sequence was identified ri 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2607 

A DNA sequence (GASx742) was identified in S.pyogenes <SEQ ID 7713> which encodes the amino acid 
sequence <SEQ ID 7714>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

»> Seems to have an uncleavable N-term signal seg 
Likelihood = -7.8C 
Likelihood = -4.8! 



Final Results 

bacterial membrane Certainty=0. 4121 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

LPXTG motif: 877-881 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB46409 GB:AL096743 putative large secreted protein 
[Streptomyces coelicolor A3 (2) ] 
Identities = 231/599 (38%), Positives = 329/599 (54%), Gaps =43/599 (7%) 

Query: 278 TSSNSDASSRNIVKIGEIQ^SASHTSPLLKKAVTVEQvVVTXL DDSTHFYVQDLNGDG 334 

T +++ +++ V+I ++QG++ SP + VT +VT + S F++QD D 

Sbjct: 28 TPAHAASAAAGPVRIHDVQGSTRLSPyAGEQVTDVAGIVTGVRGYGSSKGFWMQDPLPDA 87 



WO 02/34771 



-2721- 



PCT/GB01/04789 



Query: 335 DIATSDGIRVFAKNA- KVQVGDVLTI SGEVEE F FGRGYEERKQTDLT I TQI VAKAVTK- T 392 

D ATS+G+ VF A +V VGD +T+SG V E+ G Q+ +T+I VT + 
Sbjct: 88 DPATSEGVWFTSRAPEVAVGDAVTVSGTVSEYVPGGTSSGNQS LTEITRPTVTWS 144 

Query: 393 GTAQVPSPLVLGKDRIAPANI IDNDGLR VFDPEEDAIDYWESMEGMLVAVDDA 445 

G +P+ + + A + DG P A+DY+ES+EGM V V DA 

Sbjct: 145 GGNAIPAATTVSARSVPRAYAPEGDGAAKGSVNALPLRPGTYALDYYESLEGMNVRVADA 204 

Query: 446 KILGPMKN- KEIYVLPGSSTRPliNNSGGVLLPANSYMTDVI PVLFKKGKQI IKAGD 500 

+++G E++V P GV + NT4+ GK GD 

Sbjct: 205 RWGASDPYTELWVTVKPWENPNRRGGTVYGSYDDQNTGRLQIQ-SLGKPADFPAADVGD 263 

Query: 501 SYKGRLAGPVSYS-YGNYKVFVDDSKNMPSLMDGHLKPEKTNLQKDLSKLSIASYNIENF 559 

+ G AGP+ Y+YGY+ + ++LG+ETQ +L++A+YN+EN 
Sbjct: 264 TLAGTTAGPljDYNQYGGYTLVASE IGALESGGTERESTRRQS -AREDAVATYNVENL 319 

Query: 560 SANPSSTKDEKVKRIAESFIHDLNAPDIIGLIEVQDjmGPTDDGTTDATQSAQRLIDAIK 619 

+PS D+ AE+ +H h +PDI+ L E+QDNNG TDDGT A + RLIDAI 

Sbjct: 320 --DPS DDTFTAHAETIVHRLKSPDIVSLEE^QDNNGATDDGTVAADATVGRLIDAIV 374 

Query: 620 KLGGPTYRYVDIAPENNVDGGQPGGNIRTGFLYQPERVSLSDKPKGGARDA--LTWVMGE 677 

GGP Y + I P + DGGQPGGNIR FL+ PERVS +D+ G A A + V G+ 
Sbjct: 375 AAGGPRYDWRGIDPVDKADGGQPGGNIRQAFLFNPERVSFTDRAGGDATTATGVRKVRGK 434 

Query: 678 --LNLSVGRIDPITIAAWKDWKSLAAEFIFQGRKVVVVANHI^SKEGDNALYGCVQPVTF 735 

L S GR+DP N AW+D RK IA EF+F+GR V WANH NSK GD L QP + 
Sbjct: 43 5 AALTHSPGRVDPANEAWEDSRKPLAGEFVFRGRTVFWANHFNSKGGDQGLTAQYQPPSR 494 

Query: 736 KSEQRRHVLANMIAQFAKE--GAKHQANIVMIiGDFNDFEFTKTIQLIE-EGDMVNIiVSRH 792 

SE +RH A +4- F KE A+ A++V LGD NDFEF++T +++E +G + + V 
Sbjct: 495 GSETQRHAQAKWNTFVKE I LAAQKNAD WALGD IWDFEFSRTAR I LEGDGALWSAVKSL 554 

Query: 793 DISDRYSYFHQGNNQTLDNILVSRHLL- -DHYEFDMVHVNSPFMEAHGRASDHDPLLLQ 849 

S+RYSY +QGN+Q LD ILVS + H +D VHVN+ F H + SDHDP +L+ 
Sbjct: 555 PRSERYSYVYQGNSQVLDQILVSPSVRRGGHLSYDSVHVNAEF HDQ1 SDHDPQVLR 610 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2608 

A DNA sequence (GASx743) was identified in S.pyogenes <SEQ ID 7715> which encodes the amino acid 
sequence <SEQ ID 7716>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>>> Seems to have no N-terrainal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 243 7 (Affirmative) < suco 

bacterial membrane Certainty=o. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2609 

A DNA sequence (GASx756) was identified in S.pyogenes <SEQ ID 7717> which encodes the amino acid 
sequence <SEQ ID 77I8>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

»> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -4.30 Transmembrane ID - 26 ( 8 - 27) 
Likelihood = -3.08 Transmembrane 51 - 67 ( 50 - 67) 



Final Results 

bacterial membrane Certainty=0. 2720 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2610 

A repeated DNA sequence (GASx758) was identified in S.pyogenes <SEQ ID 7719> which encodes the 
amino acid sequence <SEQ ID 7720>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

»> Seems to have a cleavable N-term signal seq. 

Pinal Results 

bacterial outside --- Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MTNGLKYVLEQMLIjLFIIAAIjACLPIAI3LKIGYSFMGDGQSPWHILSMDKWAELVNKFT 60 

M yv++++LL,+ 1+ L L L IGLM+GY +G GQ PW ILS KW EL++KFT 

Sbjct: 3 MNKKSSYWKRLLLVIIVLILGTLALGIGLMVGYGILGKGQDPWAILSPAKWQELIHKFT 62 

Query: 61 G 61 
G 

Sbjct: 63 G 63 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2611 

A DNA sequence (GASx764) was identified in S.pyogenes <SEQ ID 772 1> which encodes the amino acid 
sequence.<SEQ ID 7722>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

>>> Seems to have no N-terminal signal sequence 
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INTEGRAL Likelihood = -3.98 Transmembrane 47 - 63 ( 46 - 67) 

Final Results 

bacterial membrane Certainty=0 .2593 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

A related sequence was also identified in GAS <SEQ ID 9149> which encodes the amino acid sequence 
<SEQ ID 9150>. Analysis of this protein sequence reveals the following: 

Possible site: 53 
>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -3.98 Transmembrane 35 - 51 ( 34 - 55) 

Final Results 

bacterial membrane Certainty=0. 2593 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2612 

A DNA sequence (GASx783) was identified in S.pyogenes <SEQ ID 7723> which encodes the amino acid 
sequence <SEQ ID 7724>. Analysis of this protein sequence reveals the following: 
Possible site: 43 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-13.16 Transmembrane 142 - 15B ( 132 - 167) 
Likelihood =-12.26 Transmembrane 113 - 129 ( 101 - 140) 
Likelihood =-10.24 Transmembrane 238 - 254 ( 233 - 260) 
Likelihood = -2.76 Transmembrane 34 - 50 ( 34 - 51) 

Final Results 

bacterial membrane Certainty=0 . 6265 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA32091 GB:AB010970 ABC-transporter [Streptococcus mutans] 
Identities = 173/269 (64%) , Positives = 214/269 (79%) , Gaps = 2/269 (0%) 

MNFLTKKNRILLREMVKTDFKLRYQGSAIGYLWSILKPLMMFTIMYLVFIRFLRLGGNVP 6 0 
M+F ++KNRILL+E++KTDFKLRYQGSAIGYLWSILKPLM+F IMY+VF+RFL LGG4VP 
MDFFSRKNRILLKELIKTDFKIiRYQGSAIGYLWSILKPLMLFAIMYIVFVRFLPLGGDVP 60 

HFPVALLLANVIWSFFSEATSMGWSIVSRGDLLRKLNFSKHI IVFSAVLGALINFLINL 12 0 
H+PVALLL NVIW+FF E T MGMVS+V+RGDLLRKLNFSK IVFSAV GA INF IN+ 





1 


Sbjct: 


1 




61 


Sbjct: 


61 




121 


Sbjct: 


121 




179 



+WLIFAL+NGVT + +L + LF+EL++ GIA +LS ++V YRD+ VWEV+LQ 

L IVA7LIFALLNGVTFTFRWNLFLLIPLFLEIiLLFSTGIAFILSTLYVRYRDIGPVWEVILQ 180 

Query: 179 AGMYATPIIYPITFVLDSHPLAAKLMLNPVAQMIQDFRYLLIDRANVTIWQMSTNWFYI 238 
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G Y TPIIY +T++ + AKLL+L+P+AQ+IQD R++LID ANVTIWQM + 

Sbjct: 181 GGFYGTPIIYSLTYIATRSWGAICriTiTiT.SPIAQIIQDMRHILIDPANVTIWQMIMHKSIA 240 

Query: 239 VIPYLVPFVILFIGIFVFKKNADRFAEII 267 

VIPYLVP + IG VF NA +FAEII 
Sbjct: 241 VI PYLVP I FVF 1 1 GFLVFNYNAKKFAE 1 1 269 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



10 Example 2613 

A DNA sequence (GASx786) was identified in S.pyogenes <SEQ ID 7725> which encodes the amino acid 
sequence <SEQ ID 7726>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

15 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3828 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MNRILLYVHFNKYNKISAHVYYQLEQMRSIjFSKIvPISNSKVSHEDLKRLiaraCLIDEFL 60 

M R+LLYVHFNKYN++S+HV YQL QMRSLFSK++FISNS+V+ D+K L+ LID+F+ 
Sbjct: 1 MKRLLLYVHFNK^VSSHVVYQLTQMRSLFSKVIFISNSQVADADWMLREKHLIDDFI 60 

Query: 61 QRKNKGFDFSAWHDGLIIMGFDKLEEFDSLTIMNDTCFGPIWEMAPYFENFEEKETVDFW 120 

QR+N GFDF+AW DG++ 4-GFD+L +DS+T MNDTCFGP+WEM ++ FE K TVDFW 
Sbjct: 61 QRQNSGFDFARWRDGWFVGFDELV^YDSVTTMNDTCFGPLWEMYSIYQEFETKTTVDFW 120 

Query: 121 GITNNRGTKAFKEHVQSYFMTFKNQVIQNKVFQQFWQS1IEYENVQEVIQHYETQLTSIL 180 

G-I-TNNR TK+F-l-EHlQSYF-;- + ?K V++ ?4 FW-1- + I EY++VQ+VI YET++T4- L 
Sbjct: 121 GLTNNRATKSFREHIQSYFISFKASVLRSTAFRDFWENIKEYQDVQKVIDQYETKVTTTL 180 

Query: 181 LNEGFSYQWFDTRKAESSFMPHPDFSYYNPTAILKHHVPFIKVKAIDANQHIAPYLIiNL 240 

L+ GF Y VFDT K ++S M H DFSYYNPTAIL H VPFIKVKAID NQHI PYLLN 
Sbjct: 181 LDAGFQYDWFDTTKEDASHMLHADFSYYNPTAILl^RVPFIKVKAIDNNQHITPYIjIiND 240 



Query: 241 IRETTNYPIDLIVSHMSQISLPDTI<YLLSQICY'LNCQRLAKQTCQKVAVHLHVFYVDLLDE 300 
I ++ + YPIDLIVSHMS+I+ PD YLL KY+ + QKVAVHLHVFYVDLL+E 
45 Sbjct: 241 IQKNSTYPIDEIVSHNSEINYPDFSYLLGHKYvTOCRERTOLKNQKVAVHLHVFYVDLLEE 300 

Query: 301 FLTAFENWNFHYDLFITTDSDIKRKEIKEILQRKGKTADIRVTGNRGRDIYPMLLLKDECL 360 

FLTAF+ ++F YDLFITTDSD K+ EI+EIL G+ A + VTGN GRD+ PML LK+ L 
Sbjct: 301 FLTAFKQFHFSYDLFITTDSDDKKAEIEEILS.ANGQEAQVFVTGNIGRDVLPMLKLKNYL 360 

50 

Query: 361 SQYDYIGHFHTKKSKEADFWAGESWRKELIDMljVKPADSILSAFETD-DIGIIIADIPSF 419 

S YD++GHFHTKKSKEADFWAG+SWR+ELIDMLVKPAD+IL+ + + IG++IAD+P+F 
Sbjct: 361 SAYDFVGHFHTKKSKEADFWAGQSWREELIDMLVKPADNILAQLQQNPKIGLVIADMPTF 420 

55 Query: 420 FRFNKIWAWNEHLIAQEMMSLWRKMDVXKQIDFQ^roTFVMSYGTFVWFKYDALKSLFD 479 

FR+NKIV+AWNEHLIA EM +LW+KM + K+IDF A TFVMSYGTFVWFKYDALK LFD 
Sbjct: 421 FRYNKIVDAt^NEHLIAPEMNTLWQCTIGMTKKIDFNAFHTFVMSYGTFVWFKYDALKPrjFD 480 



Query: 480 LELTQNDIPSEPLPQNSILHAIERLLVYIAWGDSYDFRIVKNPYELTPFIDNKLLNLRED 539 
60 L LT +D+P EPLPQNSILRAIERLL+Y1AW + YDFRI KNP +LTPFIDNKLLN R + 

Sbjct: 481 LNLTDDDVPEEPLPQNSILHAIERLLIYIAWNEHYDFRISKNPVnLTPFIDNKLLNERGN 540 
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Query: 540 EGAHTYWFNQMGGIKGALKYIIVGPAKAMKYIFLRLI^EKLK 581 

+T+V+FN MGGIKGA KYI +GPA+A+KYI R ++K+K 
Sbjct: 541 SAPNTFVDFNYMGGIKGAFKYIFIGPARAVKYILKRSLQKIK 582 

5 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2614 

A DNA sequence (GASx787) was identified in S.pyogenes <SEQ ID 7727> which encodes the amino acid 
10 sequence <SEQ ID 7728>. Analysis of this protein sequence reveals the following: 

Possible site: 33 



INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 



INTEGRAL 



INTEGRAL 
Final Results 



have a cleavable 
Likelihood =-1! 
Likelihood =-11 
Likelihood = -! 
Likelihood = -I 
Likelihood = - 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 



a = -3. 

i = -2. 

-2. 

a = -l. 



rm signal seq. 
Transmembrane 
Transmembrane 
Transmembrane 



Transmembrane 
Transmembrane 
Transmembrane 



202 - 218 ( 191 - 

340 - 356 ( 335 - 

270 - 286 ( 263 - 

124 - 140 ( 118 - 

377 - 393 ( 375 - 

291 - 307 ( 290 - 

160 - 176 ( 159 - 

SO - 66 ( 48 - 



224) 
365) 
289) 
145) 
395) 
311) 
180) 



— Certainty=0 
•-- Certainty=0 



7262 (Affirmative 
0000 (Not Clear) . 
0000 (Not Clear) • 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA32095 GB:AB010970 ORF7 [Streptococcus mutans] 
Identities = 374/775 (48%) , Positives = 525/775 (67%) , Gaps = 7/775 (0%) 





53 


Sbjct: 


30 




113 


Sbjct: 


90 




168 


Sbjct: 


150 




228 


Sbjct: 


208 


Query: 


288 


Sbjct: 


268 




348 


Sbjct: 


328 




408 


Sbjct: 


388 



V V Y++S++GLS+YLS+ 4 



KN F+Q LP+L++I+ 



- LN++ F IW L 



+P+ S+A +SL A +FNY FQYG+K D LLG Y+ PGAT +QI+ +T 



NRY TF ++ILG+II+WN LK MR+EPLLVTDF W+ 



LA I++ +L 



+ ++ S+ I F EK KI++G+P+ 



IS +NN DI + GF +NA YKSL YVWT+QVTK IM+KP++YS+E I +A+KY +A 
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Query: 468 DINKDRKNNIADQTVIYLLSESLSDPDRVSNV^/SHDVLENIKAIKNSTTAGLMQSDSYG 527 

INK R NI++QTVIY+LSES SDPDRV V +S DV+ENIK IK TT+GLM SD YG 
Sbjct: 448 KINKVRTENISNQTVIYILSESFSDPDRVQGiVKLSRDVIPNIKQIKEKTTSGLMHSDGYG 507 

Query: 528 GGTAOTEFQTLTSLPFYNFSSSVSVIiYSBWPia^KPHTISEFYQGKNRIAMHPASAMNF 587 

GGTANMEFQ+LT LP+YNF+SSVS LY+EV P M+ +IS ++ KNR+ +HP+SA+N+ 
Sbjct: 508 GGTANMEFQSLTGLPYYNFNSSVSTLYTEWPDMSVFPSISNQFKSKNRWIHPSSASNY 567 

Query: 588 NRKTWSNLGFSKFLALSGSKDKFKNIENVGLLTSDKTVYNMILSLINPSESQFFSVITM 647 

+RK VY L F F+A SG+ DK + E VGL SDKT Y NIL INPS+SQFFSV+TM 
Sbjct: 568 SRKYVYDKLKFPTFVASSGTSDKITHSEK\rGLNVSDKTTYQMILDKINPSQSQFFSVMTM 627 

Query: 648 QNHIPWSSDYPEEIVAEGKNFTEEENHNLTSYARLLSFTDKETRAFLEKLTQINKPITW 707 

QNH+PW+SD P ++VA GK +T++EN +L+SYARLL++TDKET+ Fh +L+Q+ +TW 
Sbjct: 628 QNHVPWASDEPSDWATGKGYTKDENGSLSSYARLLTYTDKETKDFIAQLSQLKHKVTW 687 

Query: 708 FYGDHLPGLYPDSAFNKHIENKYLTDYFIWSNGTNEKKNHPLINSSDFTAALFEHTDSKV' 767 

FYGDHLPGLYP+SAF K +++Y TDYFIWSK + NH +NSSDFTA L EHT+SKV 
Sbjct: 688 FYGDHLPGLYPESAFKKDPDSQYQTDYFIWSNYOTKILKHSYVNSSDFTAELLEHTNSKV' 747 

Query: 7S8 SPYYAIiTEVLNKASVDKSPDSPEVKAIQNDLKNIQYDVTIGKGYLLKHKTFFKI 822 

SPYYALLTEVL+ +V 4- E K I MDLK IQYD+T+GKGY+ +K FF I 

Sbjct: 748 SPYYALLTEVLDOT'Tl'GHGKLTKEQKEIANDLKLIQYDITVGKGYIRNYKGFFDI 802 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2615 

A DNA sequence (GASx789R) was identified in S.pyogenes <SEQ ID 7729> which encodes the amino acid 
sequence <SEQ ID 773 0>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

10 N-terminal signal sequence 

membrane 42 - 58 ( 42 - 58) 

Final Results 

bacterial membrane Certainty=0 . 1426 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



45 Example 2616 

A DNA sequence (GASx790) was identified in S.pyogenes <SEQ ID 773 1> which encodes the amino acid 
sequence <SEQ ID 7732>. Analysis of tins protein sequence reveals the following: 

Possible site: 24 

50 >>> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside --- Certainty=0 . 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

55 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2617 

A DNA sequence (GASx791) was identified in S.pyogenes <SEQ ID 7733> which encodes the amino acid 
sequence <SEQ ID 7734>. Analysis of this protein sequence reveals the following: 

Possible site: 48 



INTEGRAL 
INTEGRAL 
INTEGRAL 



3 to have a cleavable N-term signal seq. 
Likelihood = 
Likelihood = 
Likelihood = 
Likelihood = 
Likelihood = 
Likelihood = 
Likelihood = -3.24 



Transmembrane 
Transmembrane 
Transmembrane 



Transmembrane 335 



182 ( 157 - 

101 ( 79 - 

413 ( 386 - 

• 269 ( 252 - 

• 317 ( 293 • 

• 379 ( 362 - 

• 351 ( 335 • 



■ Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



- Certainty=0. 5967 (Affirmative) . 

- Certainty=0 . 0000 (Not Clear) < i 

- Certainty=0. 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA64645 GB:U10927 CapF [Staphylococcus aureus] 
Identities = 97/419 (23%) , Positives = 186/419 (44%) , Gaps = 40/419 (9%) 

Query: 12 FLVMLGSLSTAVISVILLMVVTRI^TSADSDIYJWAYSFANMMVWGLFQvENYQATDI 71 

F + + ++ +A+ ++L+V+ RL T D Y +A + + ++R+ T 

Sbjct: 5 FNYMFVANILSALCKFLILLVIVRLGTPEDVGRYNYALVITAPIFLFISLKIRSVIVT-- 62 

Query: 72 NEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131 

N+KYS ++Y+ A L ++ L I++ + T + +V + + ++ G+ 

Sbjct: 63 NDKYSPNEYISAILSLNIITLIFVAIFVYVLGNGDL--TTILIVSLIKLFENIKEVPYGI 120 

Query: 132 FQQHERLDIAGKSIAYRNTLIFMVYTAIILYSKNLTIALVAVCIVSLWIMYYDIGHSKK 191 

+Q++E L + G S+ N L +++ I +S NL +AL+ + I + D + K 

Sbjct: 121 YQKKESLKLLGISMGIYNILSLILFYIIYSFSHNLNMALLFLVISCIFSFAIIDRWYLSK 180 

Query: 192 FQKLMFSELLSNISFQNSLKLLKESF PLFLNGFLI I YI YTQPKYAIELMTTLGEVA 247 

+ + + + N++ KE F PL + L P+ +E + G+ 

Sbjct: 181 YYNI KLHYNNNIAKFKEIFILTIPLAFSSALGSLNTGIPRIVLENL- -FGKYT 231 

Query: 248 LGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIK-EFNKIQVQLFAYLGVF- 304 

LG TI +L + N + F P + + L + + K EF K+ ++ ++G+F 
Sbjct: 232 LGIFSTIAYVLVIGGLFANSISQVFLPKLRK LYKDEKKIEFEKLTRKM-VFIGIFI 286 

Query: 305 SLIALVGSGLFGIPFLSILYG TNLTDYWVDF-MLIMLGGSIGSFATVIDNILTAM 358 

+ +++ S G LS+L+G N+ + F +L +L G + 

Sbjct: 287 GMCSVILSLFLGEALLSLLFGKEYGENNIILIILSFGLLFILSGIFLGTTIIATGKYNVN 346 



Query: 359 RKQQLLLIPYTGGFLISLLITNIiFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRF 417 
K L+L+ F I L+ + L + KY +LGAAL+ 1+ V L I Y F F 

55 Sbjct: 347 YKISLILL FCI-LIFSFLLIPKYSLLGAALTITISQFVAL- — ISYYYFYKRIF 396 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2618 

A DNA sequence (GASx792) was identified in S.pyogenes <SEQ ID 7735> which encodes the amino acid 
sequence <SEQ ID 7736>. Analysis of this protein sequence reveals the following: 

Possible site: 36 



> Seems to have r 
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INTEGRAL 
INTEGRAL 
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Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 



J-terminal signal sequence 



= -1 



Transmembrane 
Transmembrane 
Transmembrane 



Transmembrane 298 - 



Transmeirbrane 
Transmembrane 
Transmembrane 
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80 ( 60 - 

59 ( 37 - 

248 ( 229 - 

426 ( 402 - 

314 ( 296 - 

494 ( 471 - 

281 ( 256 - 

396 ( 378 - 

226 ( 209 - 

203 ( 187 - 

34 < 18 - 



• Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



- Certainty=0. 5012 (Affirmative) , 

- Certainty=0. 0000 (Not Clear) < i 

- Certainty=0. 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA19642 GB:AB002668 unnamed protein product [Actinobacillus 
act inomycetemcomi tans ] 
Identities = 116/459 (25%) , Positives = 207/459 (44%) , Gaps = 60/459 (13%) 
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Query: 461 QQQKFKQIEDILSDKLAIHVSLLFIIiftMLM- -STIFRYY 497 

+ +K +1 + + LFI + + + + RYY 
Sbjct: 406 KSEKTSNNKTISKYFIIVPFLFLFISSFITINTLVSRYY 444 

5 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2619 

A DNA sequence (GASx797) was identified in S.pyogenes <SEQ ID 7737> which encodes the amino acid 

sequence <SEQ ID 773 8>. Analysis of this protein sequence reveals the following: 

10 Possible site: 49 

>» Seems to have no N-terminal signal sequence 

Final Results 

15 bacterial cytoplasm — Certainty=0. 1491 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Hot Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
20 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC83961 GB:L47648 cytidine monophosphate kinase [Bacillus subtilis] 
= 116/220 (52%) , Positives = 156/220 (70%) , Gaps = 1/220 (0%) 

VIKIAIDGPASSGKSTVAKIIAKNLGYTYLDTGAMffiSATyiALTHGYTGKEVALILEE 61 
+ IAIDGPA++GKSTVAKI+A+ Y Y+DTGAMYR+ TY AL + + E 
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Sbjct: 
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Query: 
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Sbjct: 
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Sbjct: 
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Sbjct: 




Based on 


this 



+REE+V +Q+++ 4 
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antigens for vaccines or diagnostics. 



Example 2620 

A DNA sequence (GASx799) was identified in S.pyogenes <SEQ ID 7739> which encodes the amino acid 

sequence <SEQ ID 7740>. Analysis of this protein sequence reveals the following: 

45 Possible site: 29 

>» Seems to have no N-terminal signal sequence 

Final Results 

50 bacterial cytoplasm Certainty=0. 4324 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA34313 GB:X16188 ribosomal protein L35 (AA 1-66) [Bacillus 
stearothermophilus] 
Identities = 46/65 (70%) , Positives = 51/65 (77%) 

5 

Query: 1 MPKQKTHRASAKRFKRTGSGGLKRFRAFTSHRFHGKTKKQRRHLRKAGLVSSGDFKRIKA 60 

MPK KTHR SAKRFK+T SG IiKR A+TSH F KTKKQ+RHLRKA LVS GDFKRI+ 
Sbjct: 1 MPKMKTHRGSAKRFKKTASGKLKRGHAYTSHLFANICrKKQKRHLRKATLVSPGDFKRIRQ 60 

10 Query: 61 MVTGL 65 

M+ L 
Sbjct: 61 MLDNL 65 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
15 antigens for vaccines or diagnostics. 

Example 2621 

A DNA sequence (GASx806R) was identified in S.pyogenes <SEQ ID 7741> which encodes the amino acid 
sequence <SEQ ID 7742>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

20 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 5361 (Affirmative) < suco 
25 bacterial membrane -— Certainty=0. 0000 (Hot Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2622 

A DNA sequence (GASx809R) was identified in S.pyogenes <SEQ ID 7743> which encodes the amino acid 
sequence <SEQ ID 7744>. Analysis of this protein sequence reveals the following: 

35 Possible site: 52 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -8.81 Transmembrane 33- 49 ( 28- 53) 

40 Final Results 

bacterial membrane Certainty=0. 4524 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2623 

A DNA sequence (GASx814R) was identified in S.pyo genes <SEQ ID 7745> which encodes the amino acid 
sequence <SEQ ID 7746>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

>» Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 02 0 6 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2624 

A DNA sequence (GASx817) was identified in S. pyogenes <SEQ ID 7747> which encodes the amino acid 
sequence <SEQ ID 7748>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

in uncleavable N-term signal seq 

16 - 32 ( 15 - 32) 

Final Results 

bacterial membrane Certainty=0. 1595 (Affirmative) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2625 

A DNA sequence (GASx820) was identified in S.pyogenes <SEQ ID 7749> which encodes the amino acid 
sequence <SEQ ID 7750>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -7.11 Transmembrane 62 - 78 ( 59 - 81) 
INTEGRAL Likelihood = -6.00 Transmembrane 128 - 144 ( 123 - 147) 
INTEGRAL Likelihood = -2.50 Transmembrane 5 - 21 ( 3-26) 

Final Results 

bacterial membrane Certainty=0. 3845 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0.G000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA26653 GB:M83994 prolipoprotein signal peptidase 
[Staphylococcus aureus] 
Identities = 57/153 (37%) , Positives = 96/153 (62%) , Gaps = 6/153 (3%) 

5 

Query: 1 MKKRLFVLSLILL VALDQLSKFWIVSHIALGEVECPFIPGIVSLTYLQNNGAAFSIL 56 

M K+ F+ + IL+ V DQ++K+ I + + +G+ IP ++H-T +NNGAA+ IL 
Sbjct: 1 MHKKyFIGTSILIAVFWIFDQVTKYIIATTMKIGDSFETVIPHFliNITSHRISINGAAWGIIi 60 

10 Query: 57 QDQQWFFWITVLVIGYAIYYIATHPHtNIWKQLALLLIISGGIGNFIDRLRIAYVIDMI 116 

+ FF +IT++++ +Y+ N++ Q+A-t- L+ +G +GNFIDR+ V+D I 

Sbjct: 61 SGKMTFFFIITIIILIALVYFFIKmQYNLFMQVAISLLFAGALGNFIDRILTGEWDFI 120 

Query: 117 HLDF--VDFAIFNVADSYLTVGVILLLICLWKE 147 
15 + DF IFN+ADS LT+GVIJj++I L K+ 

Sbjct: 121 DTNIFGYDFPIFNIACSSLTIGVILIIIALLKD 153 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2626 

A DNA sequence (GASx822R) was identified in S.pyogenes <SEQ ID 775 1> which encodes the amino acid 
sequence <SEQ ID 7752>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

25 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0.2638(Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2627 

A DNA sequence (GASx823R) was identified in S.pyogenes <SEQ ID 775 3> which encodes the amino acid 
sequence <SEQ ID 7754>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3452 (Affirmative) < suco 

45 bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2628 

A DNA sequence (GASx828) was identified in S.pyogenes <SEQ ID 7755> which encodes the amino acid 
sequence <SEQ ID 7756>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

5 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

10 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2629 

A DNA sequence (GASx836) was identified in S.pyogenes <SEQ ID 7757> which encodes the amino acid 

sequence <SEQ ID 7758>. Analysis of this protein sequence reveals the following: 

20 Possible site: 18 

»> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm Certainty=0. 4333 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < auco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2630 

A DNA sequence (GASx853R) was identified in S.pyogenes <SEQ ID 7759> which encodes the amino acid 
35 sequence <SEQ ID 7760>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>>> Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm Certainty=0 .4906 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2631 

A DNA sequence (GASx854R) was identified in S.pyogenes <SEQ ID 776 1> which encodes the amino acid 
sequence <SEQ ID 7762>. Analysis of this protein sequence reveals the following: 

Possible site: 43 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3989 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

A related sequence was also identified in GAS <SEQ ID 9147> which encodes the amino acid sequence 
<SEQ ID 9148>. Analysis of this protein sequence reveals the following: 

Possible site: 42 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty= 0.399 (Affirmative) < suco 

bacterial membrane Certainty= 0.000 (Not Clear) < suco 

bacterial outside Certainty= 0.000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 LMESKEIYLTKSPYIRGSLEIHSKNRKHEKIinjTOAKPNSTRSDVFKKyKDNKTINMKDF 60 

LM++ +IY SPY+ G +EI +K+ KHE+I+L+D+ TRSD+F KYKDN+ INMK+F 
Sbjct: 167 LMDNYKIYDATSPYVSGRIEIGTKDGKHEQIDLFDSPNEGTRSDIFAKYKDNRIINMKNF 226 

Query: 61 SHFDIYL 67 

SHFDIYL 
Sbjct: 227 SHFDIYL 233 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2632 

A DNA sequence (GASx855R) was identified in S.pyogenes <SEQ ID 7763> which encodes the amino acid 
sequence <SEQ ID 7764>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

»> Seems to have a cleavable N-term signal seq. 



Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) 

bacterial membrane Certainty=0. 0000 (Not Clear) < i 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < i 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2633 

A DNA sequence (GASx856) was identified in S.pyogenes <SEQ ID 7765> which encodes the amino acid 
sequence <SEQ ID 7766>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

»> Seems to have no N-terminal signal sequence 

, Final Results 

bacterial cytoplasm — Certainty=0. 4145 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2634 

A DNA sequence (GASx862) was identified in S.pyogenes <SEQ ID 7767> which encodes the amino acid 
sequence <SEQ ID 7768>. Analysis of this protein sequence reveals the following: 
Possible site: 19 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 6285 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2635 

A DNA sequence (GASx863) was identified in S.pyogenes <SEQ ID 7769> which encodes the amino acid 
sequence <SEQ ID 7770>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — - Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2636 

A DNA sequence (GASx878) was identified in S. pyogenes <SEQ ID 777 1> which encodes the amino acid 
sequence <SEQ ID 7772>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

10 ■>» Seems to have a cleavable N-term signal seg. 

Final Results 

bacterial outside --- Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

15 bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2637 

A DNA sequence (GASx887R) was identified in S.pyogenes <SEQ ID 7773> which encodes the amino acid 
sequence <SEQ ID 7774>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1911 (Affirmative) < suco 

30 bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2638 

A DNA sequence (GASx910) was identified in S.pyogenes <SEQ ID 7775> which encodes the amino acid 
sequence <SEQ ID 7776>. Analysis of this protein sequence reveals the following: 

40 Possible site: 20 

»> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0. 4511 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 
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bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2639 

A DNA sequence (GASx91 1) was identified in S. pyogenes <SEQ ID 7777> which encodes the amino acid 
sequence <SEQ ID 7778>. Analysis of this protein sequence reveals the following: 

Possible site: 52 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2993 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC74707 GB:AE000259 glutathionine S-transf erase [Escherichia 
coli] 

Identities = 29/137 (21%) , Positives = 61/137 (44%) , Gaps = 9/137 (6%) 

Query: 1 LPFIAKQTLKSQLIPQDNLLAESRFNEIMDFLTGDFPLVFRPMINPHRYTISQDNQALEK 60 

+ ++A QL+ N ++ + E ++++ + F P+ P E+ 
Sbjct: 70 MQYLADSVPDRQLLAPVNSISRYKTIEWLNYIATELHKGFTPLFRP DTPEE 120 

Query: 61 VKQASYKR^IAMTHLDSLIGESGHVYRDQQTIADAYAYAMALWSQKTPKSYENYPHLAA 120 

K +++++++++ + + TIADAY + + W+ + E H+AA 

Sbjct: 121 YKPTVRAQLEKKLQYVNEALKDEHWICGQRFTIADAYLFTVLRWAYAVKLNLEGLEHIAA 180 

Query: 121 FMAKMVEDSAVQQVLNA 137 

FM +M E VQ L+A 
Sbjct: 181 FMQRMAERPEVQDALSA 197 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2640 

A DNA sequence (GASx932R) was identified in S.pyogenes <SEQ ID 7779> which encodes the amino acid 
sequence <SEQ ID 7780>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 4081 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2641 

5 A DNA sequence (GASx935) was identified in S.pyogenes <SEQ ID 778 1> which encodes the amino acid 
sequence <SEQ ID 7782>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

>>> Seems to have no N-terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0 . 6304 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2642 

A DNA sequence (GASx937) was identified in S.pyogenes <SEQ ID 7783> which encodes the amino acid 

sequence <SEQ ID 7784>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

25 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 3503 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2643 

A DNA sequence (GASx938R) was identified in S.pyogenes <SEQ ID 7785> which encodes the amino acid 
sequence <SEQ ID 7786>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2884 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

5 Example 2644 

A DNA sequence (GASx939) was identified in S.pyogenes <SEQ ID 7787> which encodes the amino acid 
sequence <SEQ ID 7788>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

10 »> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 2771 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Mot Clear) < suco 

15 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2645 

A DNA sequence (GASx941) was identified in S.pyogenes <SEQ ID 7789> which encodes the amino acid 
sequence <SEQ ID 7790>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

25 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2257 (Affirmative) < suco 

30 bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2646 

A DNA sequence (GASx942R) was identified in S.pyogenes <SEQ ID 7791> which encodes the amino acid 
sequence <SEQ ID 7792>. Analysis of this protein sequence reveals the following: 

40 Possible site: 23 

>>> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0. 3255 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
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bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



dqnsgenkhkrvhnlvsdilnrtqntdniklv:eyvcnplryinevsifeqlrtainipl 97 
d + k r++n + n + +1 i p r+ + fe +r +n l 
dtdpqmtkrhrlynafasdqnsrkqrthi iafirkamkperfardserfepmrlninral 98 



? VL+FC +ELL +KYFH 



AV EA K V +IR TG D A L+D+ F P++ I N+LQ+ E+ E +G NL 





38 


Sbjct: 


39 




98 


Sbjct: 


99 




158 


Sbjct: 


159 


Query: 


218 


Sbjct: 


217 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2647 

A DNA sequence (GASx943R) was identified in S.pyogenes <SEQ ID 7793> which encodes the amino acid 
sequence <SEQ ID 7794>. Analysis of this protein sequence reveals the following: 
Possible site: 30 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1526 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2648 

A DNA sequence (GASx944) was identified in S.pyogenes <SEQ ID 7795> which encodes the amino acid 
sequence <SEQ ID 7796>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1427 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2649 

A DNA sequence (GASx945) was identified in S.pyogenes <SEQ ID 7797> which encodes the amino acid 
sequence <SEQ ID 7798>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

10 Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 2578 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC98430 GB:L29324 excisionase [Streptococcus pneumoniae] 
20 Identities = 23/54 (42%) , Positives = 40/54 (73%) 

Query: 1 LIQQWEGLTVATAKQWATEMRDHPDFKQFVtNPTHRlVFIDYEGFKLFVQWKSR 54 

++++W+GL T +W EMR++ F +V+NPTH++VFI+ EGF+ F++WK + 
Sbjct: 21 ILKRWDGmKYTLNRWIKEMRFJ^TFShCTINPTHKLWINLEGFESFLRWKQK 74 

25 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2650 

A DNA sequence (GASx946) was identified in S.pyogenes <SEQ ID 7799> which encodes the amino acid 
30 sequence <SEQ ID 7800>. Analysis of this protein sequence reveals the following: 

Possible site-. 16 

»> Seems to have an uncleavable N-term signal seg 

INTEGRAL Likelihood = -4.99 Transmembrane 3 - 19 ( 1 - 23) 

35 

Final Results 

bacterial membrane Certainty=0 .2996 (Affirmative) < suco 

bacterial outside Certainty=0.0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2651 

A DNA sequence (GASx950) was identified in S.pyogenes <SEQ ID 7801> which encodes the amino acid 
sequence <SEQ ID 7802>. Analysis of this protein sequence reveals the following: 
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»> Seems to have no N-terminal signal sequence 

5 Final Results 

bacterial cytoplasm Certainty=0 . 2211 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



10 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2652 

15 A DNA sequence (GASx951) was identified in S.pyogenes <SEQ ID 7803> which encodes the amino acid 
sequence <SEQ ID 7804>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

>» Seems to have no N-terminal signal sequence 

20 

Final Results 

bacterial cytoplasm Certainty=0. 4258 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

25 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

30 Example 2653 

A DNA sequence (GASx952) was identified in S.pyogenes <SEQ ID 7805> which encodes the amino acid 
sequence <SEQ ID 7806>. Analysis of this protein sequence reveals the following: 

Possible site: 46 

35 »=> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2476 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

40 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF74110 GB:AF212847 ORF245 [Lactococcus lactis bacteriophage 
45 U136.2] 

Identities = 82/265 (30%) , Positives = 128/265 (47%) , Gaps = 27/265 (10%) 



Query: 1 MANQLSTQQVKRDITTDPTLLTGADIKKYFDPQNLLSEKQVGQAIALCKGRNLNPFANEV 60 
MAN+L V L IK+Y D S+ ++ + LCK N+NPF EV 

50 Sbjct: 1 MANELGIFSVDN LNMTTIKQYIJXSGGKASnflELVLLINLCKQNNMNPFMKEV 52 
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61 


Sbj ct: 


53 


Query: 


120 


Sbjct: 


110 


Query: 


180 


Sbjct: 


169 


Query: 


235 


Sbjct: 


220 



T ++EYV++ K G+P W + P TM+ K A Q LR A 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2654 

A DNA sequence (GASx953) was identified in S.pyogenes <SEQ ID 7807> which encodes the amino acid 
sequence <SEQ ID 7808>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3413 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF74111 GB:AF212847 ORF364 [Lactococcus lactis bacteriophage 
U136.2] 

= 67/222 (30%), Positives = 120/222 (53%), Gaps = 3/222 (1%) 



R ++KK ++ P + +G+ K+A P++ 1+ I 



Ident: 






1 


Sbjct: 


19 


Query: 


61 


Sbjct: 


79 


Query: 


121 


Sbjct: 


139 




181 


Sbjct: 


197 



^ S+ ++ E QK E + AK + 



T+ E++ I D L EKQ+ 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2655 

A DNA sequence (GASx954) was identified in S.pyogenes <SEQ ID 7809> which encodes the amino acid 
sequence <SEQ ID 7810>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3884 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2656 

A DNA sequence (GASx955) was identified in S.pyogenes <SEQ ID 781 1> which encodes the amino acid 
sequence <SEQ ID 7812>. Analysis of this protein sequence reveals the following: 
Possible site: 34 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 1777 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2657 

A DNA sequence (GASx956) was identified in S.pyogenes <SEQ ID 7813> which encodes the amino acid 
sequence <SEQ ID 7814>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

) have no N-terminal signal sequence 

I ( 81 - 98) 



Final Results 

bacterial membrane Certainty=0. 1977 (Affirmative) < suco 

bacterial outside — - Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2658 

A DNA sequence (GASx958) was identified in S.pyogenes <SEQ ID 7815> which encodes the amino acid 
5 sequence <SEQ ID 7816>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 . 3673 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2659 

20 A DNA sequence (GASx960) was identified in S.pyogenes <SEQ ID 7817> which encodes the amino acid 
sequence <SEQ ID 7818>. Analysis of this protein sequence reveals the following: 
Possible site: 36 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm --- Certainty=0. 1852 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2660 

A DNA sequence (GASx961) was identified in S.pyogenes <SEQ ID 7819> which encodes the amino acid 
sequence <SEQ ID 7820>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

40 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 7380 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>SP:AAF63071 GB:AF158600 gpl37 [Streptococcus thermophilus 
bacteriophage Sfiia] 
Identities = 67/136 (49%), Positives = 97/136 (71%), Gaps = 2/136 (1%) 

Query: 5 PEIDIQICTKSNAKRKIiREYPRWRRIANDVDTQKVTATYSFEPRQPHGTPSKPVERLALNR 64 

PEID + T KRKLREYPRWR IA+D QK+T ++F PR G +KPVE +A+ R 
Sbjct: 4 PEIDEKATLKRCKRKLREYPRWREIAHDSAEQKITQEFTFMPRG- -GGVNKPVENIAVRR 61 

Query: 65 VSAEQELDTIERAVNGIFDPEYRLILIDKXLLTYPKTDCDIYTKLGYEKSQYYNMLDNAL 124 

V A EL+ IE+AVNG++ P+YR ILI+KYL PK + I +G+E++ + +L+N++ 
Sbjct: 62 VDALNELEAIEQAWGLYRPDYRRILIEKYLAYPPKPNWQIAQSIGFERTAFQELLNNSI 121 

Query: 125 LSFSELYKEGMLLVEK 140 

L+F+ELY++G L+VE+ 
Sbjct: 122 LAFAELYRDGRLIVER 137 

Based on this analysis, it was predicted that this QAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2661 

A DNA sequence (GASx962) was identified in S. pyogenes <SEQ ID 782 1> which encodes the amino acid 
sequence <SEQ ID 7822>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

>» Seems to have no N-terminal signal e 



Final Results 

bacterial cytoplasm Certainty=0 .3375 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2662 

A DNA sequence (GASx963R) was identified in S.pyogenes <SEQ ID 7823> which encodes the amino acid 
sequence <SEQ ID 7824>. Analysis of this protein sequence reveals the following: 
Possible site: 48 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2663 

A DNA sequence (GASx964) was identified in S.pyogenes <SEQ ID 7825> which encodes the amino acid 
sequence <SEQ ID 7826>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

>>> Seems to have a cleavable N-term signal ssq. 

INTEGRAL Likelihood = -S.16 Transmembrane 90 - 106 ( 89 - 111) 
INTEGRAL Likelihood = -5.52 Transmembrane 131 - 147 ( 129 - 150) 
INTEGRAL Likelihood = -0.43 Transmembrane 53 - 69 ( 52 - 69) 



le Certainty=0 .3463 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2664 

A DNA sequence (GASx965) was identified in S.pyogenes <SEQ ID 7827> which encodes the amino acid 
sequence <SEQ ID 7828>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 3944 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . D00D (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAAS6779 GB.-X98106 Rorfl72 [Bacteriophage phigle] 
Identities = 36/82 (43%) , Positives = 52/82 (62%) , Gaps = 3/82 (3%) 

Query: 18 ELTEKQQRFVDKyiTTFNATESAKQAGYSEKSAySQGQRLLKNVEIQKAMKERFLEAKDT 77 

+LT KQQ+F D+YI + NA ++A++AGYS++SA S GQ L +I++ + ER + 
Sbjct: 4 KLTPKQQKFADEYI KSGNAADAARKAGYSKRSARSVGQENLTKPDI KQYI DERM DEI 60 

Query: 78 KGDRIQDVAETLEQDTS IARGE 99 

RI D E +E T IARGE 
Sbjct: 61 ASKRIMDATEAVELLTRIARGE 82 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2665 

A DNA sequence (GASx966) was identified in S.pyogenes <SEQ ID 7829> which encodes the amino acid 
sequence <SEQ ID 7830>. Analysis of this protein sequence reveals the following: 
Possible site: 36 
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>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2389 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB13115 GB:Z99110 PBSX defective prophage terminase (large 
subunit) [Bacillus subtilis] 
Identities = 117/417 (28%) , Positives = 195/417 (46%) , Gaps = 33/417 (7%) 

Query: 31 YRWKGSRGSKKSKTTAmFIWLLKyPVffiNLLVIRRYSmMKQSTYTDFKJJACNQLKVT 90 

Y+ + G GS KS TAL +++LLK LVIR +T+4 ST+ F+ +L 4-T 

Sbjct: 21 YQFLVGGyGSSKSYHTALKIVLKLLKEK-RTALVIREVFDTHRDSTFALFQEVIEELGLT 79 

Query: 91 HLFKFNESLPEITVKATGQKILFRGLDDELKITSITVDVGALCWAWFEEAYQIETEDKFS 150 

S ++ G +I+F+G+D+ K+ S V + W EE +4+ E 
Sbjct: 80 KAVASLSSPLQLRFH-NGSRIMFKGMDNPAKLKS VHNISLIWIEECSEVKYEG 131 

Query: 151 TWESIRGSLDAPDFFKQITVTFNPWSERHWLKRVFFDEETKR 193 

+ + G L P+ + T NP +W R FF +E K+ 
Sbjct: 132 --FKELIGRLRHPELKLHMICTTNPVGTSNWTYRHFFRDERKKRFVLDDSELYEKRTIVK 1B9 

Query: 194 ADTFSGTTTFRVNEWLDDVDKRRYEDLYKT15IPRRARIVCDGEWGVAEGLVFDNFEWDFD 253 

DT+ +T N +L + ++ + L + +P RI G +GV V FEV+ D 
Sbjct: 190 GDTYYIfflSTANDNI^PESYVKQLDGLKETO^ 249 

Query: 254 - VEKT I QRVKET - - SAGMDFGFTQDPTTLI CVAVDLANKELWLYNEHYQKAMLTDHI VKM 310 

V+K I + + GMDFGF + ++ +AVD K L++Y E+YQ M D + 

Sbjct: 250 QVKKCIAAISKPIFRTGMDFGFEESY^VVTUjATOPEKKYLYIYWEYYQNKMTDDRTAEE 309 

Query: 311 IRDKNLHRSYIAGDSAEKRLIAEIKBKGVSGIVPSIKGKGSIMQGIQFMQGF-KIYIHPS 369 

+R+ + I DSAE + 1 + +G +V + K GS +Q + ++ F KI+ 
Sbjct: 310 LREFIETQELIKADSAEPKSIQYFRQQGFR-MVGARKFPGSRLQYTKKVKRFKKIFCSDR 368 

Query: 370 CEHTIEEFNTYTFKQDKEGI^L^PIDK1»IHVIDAIRYALEKYHIRSNESNQFEVLR 426 

CE+ IE T T+ +DK G + +- + H + AI YAL+ Y + + + +R 
Sbjct: 369 CENVIYELETLTYAKDKNGALIEDEFTIDPHTLSAIWYALDDYEVADMKETAHKRMR 425 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2666 

A DNA sequence (GASx967) was identified in S.pyogenes <SEQ ID 783 1> which encodes the amino acid 
sequence <SEQ ID 7832>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4899 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

?GP:AAC34397 GB:AF158600 gp502 [Streptococcus thermophilus 
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bacteriophage Sfill] 
Identities = 67/114 (58%), Positives = 83/114 (72%) 

Query: 6 FRDSTGKTKTLEFRFHREREMRYQAESLESLLTEKYKLLREMIEHHDKVQKPRIQELLDY 65 
5 F DSTG+ L RFHRE+R+RY+A++LE L+ ++LD+ I HH Q PRIQELLDY 

Sbjct: 7 FTDSTGQDLVLNLRFHRESRIRYRADNLEELMVMNWELLKNFINHHKLRQAPRIQELLDY 66 

Query: 66 AEGNNHTISEIGRRKDDDMADVRAVHNYGKYISTLKQGYLVGNPIRVEYIDGTE 119 
A G NH + + GRRKD++MAD RAVHNYG+ IS K GYL GNPIRVEY D + 
10 Sbjct: 67 ARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNED 120 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2667 

15 A DNA sequence (GASx968) was identified in S.pyogenes <SEQ ID 7833> which encodes the amino acid 
sequence <SEQ ID 7834>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

20 

Final Results 

bacterial cytoplasm Certainty=0 .4007 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

25 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC34397 GB:AF158600 gp502 [Streptococcus thermophilus 
bacteriophage Sfill] 

30 Identities = 172/319 (53%), Positives = 227/319 (70%), Gaps = 9/319 (2%) 



Query: 1 LIYRSMDDKTEVVRLDPREVFVIYQNNLEQSSLAGWYYNKNQLDGTTKIVELYTDNKIL 60 

+IYRS D+T + HIPE FVIY N+LE +S+A VRYYN+ L +VE+YT+ I 

Sbjct: 157 VIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDWEIYTNQHIY 216 

Query: 61 KFEYDGDLTPIGETSSHAFGSVPITEYLNTDDGMGDYETELSLIDLYDAAQSDTANYMQD 120 

+ IT HAFG+VPITE+LN DG+GDYETEL LIDLYD+A+SDTAN+M D 

Sbjct: 217 TLDASDSFNEISVTP-HAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSD 275 

Query: 121 LSDAIIiAIIGRVSFPGYVDTAEKAIEYLRKMRKARLLNLEPPVDQDGREGSVDAKYLYKQ 180 

++DAILAI G ++ P +■ ++ M++ RL+ L+PP DG+EG+V A+YL K 

Sbjct: 276 MADAILAIYGDLALPQGMQASD MKRTRLMQLKPPKSADGKEGTVKAEYLTKS 327 

Query: 181 YDVQGTFAYKNRIVSDIHKFTOTPDMTDSKFAGQQSGEALKWKVFGLDQERVDMQALFEQ 240 

YDV G EAYK R+ DIH FTNTPDM+D+ F+G SGEALK+K+FGLDQ+RVD Q+ F Q 
Sbjct: 328 YDVSGAEAYKTRMKDIHVFTCTPDMSDNHFSC-NASGEALKYKLFGLDQDRVDTQSQFTQ 387 

Query: 241 SLKRRYKLIARVSQLLKEIDDFDISKLKITFTPNLPKSLQEKIEAFKALGGELSQETAMA 300 

LKRRY+L AR+ L+ E DFD S+LKITFTPNLPKSL E++ LGG++SQETA++ 
Sbjct: 388 GLKRRYRLAARIGSIiVNEFKDFDESRLKITFTPNLPKSLYEQVSILWDLGGQVSQETALS 447 

Query: 301 ITDIVEDAKKEISLINSES 319 

++ +VE+ +E+ IN ES 
Sbjct: 448 LSGLVENPTEELDKINEES 466 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2668 

A DNA sequence (GASx969) was identified in S.pyogenes <SEQ ID 7835> which encodes the amino acid 
sequence <SEQ ID 783 6>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 5307 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0 0 0 0 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC79543 GB:U88974 ORF28 [Streptococcus thermophilus temperate 
bacteriophage 01205] 
Identities = 118/309 (38%) , Positives = 183/309 (59%) , Gaps = 18/309 (5%) 

YWRDRIKKEMDAK-EADDISLEQSMKQLHDYHFRNIEKEIESFYQRYADKEKIDLSEARK 66 
YW R +E +A + + +4- ++ L++ + KE++++ Q+YA+K + +S+A++ 

YWSKRTLREREASIKKGFAEFKKELFJULYNLQLSQLRKELDAYIQKYANKNGLSVSDAKR 62 



Query: 


8 


Sbjct: 


3 


Query: 


67 


Sbjct: 


63 


Query: 


127 


Sbjct: 


110 




187 


Sbjct: 


169 




247 


Sbjct: 


229 




304 


Sbjct: 


289 



GY+ E + Mi +V £ 



GA+WS+ IW RQ++LR 1+ V A+++G+NGLTIAR IR+ D S + A+RLAITEHA 



RVQ M ENGF F ++PE +ACD+CKDI K P + IG N+PP HPY 

RVQTIAQESIMKENGFEHFKLMPE3RACDICKDIGKETEKNPVKIADMEIGTNAPPIHPY 288 

CRCAIVGVD 312 
CRCA+V V+ 
CRCAWEVE 297 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2669 

A DNA sequence (GASx970) was identified in S.pyogenes <SEQ ID 7837> which encodes the amino acid 
sequence <SEQ ID 7838>. Analysis of this protein sequence reveals the following: 

Possible site: 15 
50 »> Seems to have no N-terminal signal sequence 



• Final Results 

bacterial cytoplasm Certainty=0 .2091 (Affirmative) <: suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2670 

5 A DNA sequence (GASx971) was identified in S.pyogenes <SEQ ID 7839> which encodes the amino acid 
sequence <SEQ ID 7840>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

>» Seems to have a cleavable N-tenr. signal seq. 

10 

Final Results 

bacterial outside Certainty=0 . 3 000 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2671 

A DNA sequence (GASx972) was identified in S.pyogenes <SEQ ID 7841> which encodes the amino acid 
sequence <SEQ ID 7842>. Analysis of this protein sequence reveals the following: 

Possible site: 46 

25 >•» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3226 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside — - Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2672 

A DNA sequence (GASx973) was identified in S.pyogenes <SEQ ID 7843> which encodes the amino acid 
sequence <SEQ ID 7844>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 183 0 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2673 

A DNA sequence (GASx975) was identified in S.pyogenes <SEQ ID 7845> which encodes the amino acid 
sequence <SEQ ID 7846>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

>» Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4757 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0 0 0 0 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB07248 GB:AP001519 unknown [Bacillus halodurans] 
Identities = 46/134 (34%) , Positives = 73/134 (54%) 

Query: 23 KQPQDEKKYTDMJVDAIIDKKFARWKSEQEAEKSEA^ 82 

K + E+ +T +V+ 1+ + A+ ++E EA+K+AKMN ++K +VE +KL E + 
Sbjct: 66 KPNKTERLFTQEEVNRIVTORLARALKDra 125 

Query: 83 ELKNDKTRNELTRVARQMFAESEINVND^^ 142 

+LK + R EL A +M E+ I +DDVL W DAEQT+ V T + K+ 
Sbjct: 126 QLKKAQIVKYELGREATMLGFAGIMADDDVLSFVVRDDAEQTQFAVKTFISLVDKIADMR 185 

Query: 143 RKALVRQTTPSTGG 156 

K ++ P G 
Sbjct: 186 MKEKLKGRPPKKDG 199 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2674 

A DNA sequence (GASx976) was identified in S.pyogenes <SEQ ID 7847> which encodes the amino acid 
sequence <SEQ ID 7848>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2478 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC79545 GB:U88974 ORF30 [Streptococcus thermophilus temperate 
bacteriophage 012 05] 
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Identities = 43/119 (36%), Positives = 66/119 (55%), Gaps = 16/119 (13%) 

Query: 9 SKEILHNLDYEAISVTIESNKIG KKWPAGTILAGKDKSIFEDRKQKVETVTNEE 63 

+ 1+ +L Y+A+S T+DS+ G KK + AGT++AG SIF+DR + V 
Sbjct: 9 TSNIVRSLPYKAVSATVDSSYPGVLvDGKKYIKaGTLVAGNGGSIFDDRTKSV 61 

Query: 64 VSTKEYVDGILLTDVDLTNGDAVGSCWRGTINRDKLADSSVAENYDDLEEVLPHIVFI 122 

V K +GI+L DVDLT + V S +Y G + DK+ + D +++ LP + FI 
Sbjct: 62 vFWKTEPEGIvLYDVDLTIDNTV-SVLYAGEVYKDKVNGGDIT DTVKKALPLVKFI 116 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2675 

A DNA sequence (GASx978) was identified in S.pyogenes <SEQ ID 7849> which encodes the amino acid 
sequence <SEQ ID 7850>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 4238 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC79546 GB:U88974 ORF31 [Streptococcus thermophilus temperate 
bacteriophage O1205] 
Identities = 195/343 (56%), Positives = 256/343 (73%), Gaps = 1/343 (0%) 

Query: 1 MALIHE 1 1 TSENI KGFYNAKNENVENTLGEKAFPPKQQLGLKLS FI KGAAGKPVTLKAAA 60 

M LI++ +T+ NI G++NA ENV +TLGE FP ++QLG KLS+IKGA+G+ V LKAAA 
Sbjct: 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAA 60 

Query: 61 FDTKVPLRDRmVELIDEEMPFFKKWLVKFADRQQI^I^QTKNNELIDTILASIYNDQ 120 

FDT V +RDR++ E+ DE+MPFFKEAMLVKE DRQQLN++ + N L++TI+A I+ND 
Sbjct: 61 FDTNVTIRDRVSAEIffiDEQMPFFKEAMLVKENDRQQIjNLVKDSGNAVIjVNTIVAGIFNDN 120 

Query: 121 ATLIAGAKARLEAMRMEVLSKGKIHIQSNGVMKDIDYGLAEDQTTKPDAKWDSAGTATPL 180 

TL+ GA+ARLEAMRM+VL+ GKI S+GV KDIDYG+ D + W G ATPL 
Sbjct: 121 LTLWGARARLEAMRMQVIATGKIAFTSDGVNKDIDYGVKPDHKKQVSKSWAEPG-ATPL 179 

Query: 181 KDIEKAIEKMAERGFVPEAIII^SECTFSLIKNAESTDDVWP^PNGAAVTKFJ^iajTYLE 240 

D+E AIE E G PE +MN+KTF LI+ A ST+ V+KP+A +G+AVTK +L Y+ 
Sbjct: 180 ADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYIA 239 

Query: 241 DELQIKVILKDGMFVGDDGESRKYFPDGFATLVPNGNLGYTVFGTTPEQSDLLGGEATDA 300 

D + ++L-H-G + D GE K++PDG TL+PNG LG TVFGTTPE+SDL +A 
Sbjct: 240 DNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNWFGTTPEESDLFADNTVNA 299 

Query: 301 NVSIVETGIAITTTKTTDPVNVQTKVSMIALPSFERLEEVHII 343 

V IV+ GIA+TTTKTTDPVNVQTKVSM+ALPSFERL++V+++ 
Sbjct: 300 EVEIVDNGIAVTTTKTTDPVNVOTKVSMVALPSFERIjDDVYML 342 



55 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2676 

A DNA sequence (GASx979) was identified in S.pyogenes <SEQ ID 7851> which encodes the amino acid 
sequence <SEQ ID 7852>. Analysis of this protein sequence reveals the following: 

Possible site: 46 

5 

»=. Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3319 (Affirmative) < suco 

10 bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty^O. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2677 

A DNA sequence (GASx980) was identified in S.pyogenes <SEQ ID 7853> which encodes the amino acid 
sequence <SEQ ID 7854>. Analysis of this protein sequence reveals the following: 

20 Possible site: 55 

»> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm Certainty=0 .2385 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
30 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC34404 GB:AF15B600 gpll3 [Streptococcus thermophilus 
bacteriophage Sfill] 
Identities = 53/109 (48%) , Positives = 79/109 (71%) , Gaps = 4/109 (3%) 

35 Query: 11 IVKNVKLDLGIEDDNQDQLLEMLLNRITDHFKANYGVLEIDNAFSFVLEDCLIARFNRRG 70 
+++NV 4-DL I DDN LL +LL RI +HFKA YGV E+D+ +F+ EDCL4- RFNRRG 
Sbjct: 9 VIQNVSVDLNINDDN LLGILLERIVNHFKAEYGVDEVDDNIAFIFEDCLVKRFNRRG 65 

Query: 71 SERAKTEBiVEGHKTTYYDHI^FEPYDAMIMAKIiNLIKDKSRKGGLYFL 119 
40 +E A++E ++GH +YYD+ NEF+PYD M+ +L ++++G + FL 

Sbjct: 66 AEGARSESIDGHSMSYYDNENEFDPYDNMLQ-RLYGTSGQAKEGEVLFL 113 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2678 

A DNA sequence (GASx981) was identified in S.pyogenes <SEQ ID 7855> which encodes the amino acid 
sequence <SEQ ID 7856>. Analysis of this protein sequence reveals the following: 

Possible site: 49 
50 >>> Seems to have no N-terminal signal sequence 
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Final Results 

bacterial cytoplasm Certainty=0 . 5714 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

5 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA59188 GB:X84706 b3 [Bacteriophage Bl] 
Identities = 28/82 (34%) , Positives = 49/82 (59%) , Gaps = 2/82 (2%) 

10 

Query: 1 MRYADROTFVKTT -DEQYNPDLGEYTHTEVI S ITKPCFVMDMGMEKS VQI FGDYQKDRKV 59 

+RY D VTF+K + D Y+PDLGE+ E + D+G ++SV++FGD +K KV 

Sbjct: 1 LRYLDEVTFIKESPDSHYDPDLGEWVEKEPTRTVFSANITDIGTDRSVEVFGDIKKGAKV 60 

15 Query: 60 I YLKQPYT - KAFDYCEYEGRRY 80 

+ + + +DY E++ +++ 
Sbjct: 61 MRMMPLFNMPKYDYIEFDNKKW 82 

Based on mis analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2679 

A DNA sequence (GASx982) was identified in S.pyogenes <SEQ ID 7857> which encodes the amino acid 
sequence <SEQ ID 7858>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2509 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

35 >GP:AAC34406 GB:AF158600 gpll4 [Streptococcus thermophilus 

bacteriophage Sfill] 
Identities = 44/103 (42%) , Positives = 65/103 (62%) , Gaps = 5/103 (4%) 

Query: 17 GLKKKLELIIKKBAVKK IVRDNGTQLQRKMINKAVFTKGYSTGATRRSITMQIGDGG 73 

40 GL + + ++K + +K ++R G++L+ +N+A F KGYSTGATRRS IT+Q+ 

Sbjct: 8 GLDEMAQSLLKNASPEKRSK^RKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDK 67 

Query: 74 LSVKVKPGTHYAGYLERGTRLMSKQPFVLPALKEQKVKFRKDL 116 
+V4 T Y+GYLE GTR M QPF+ PAL E K ++L 
45 Sbjct: 68 ATVEAL--TSYSGYLEVGTRKMEAQPFMKPALD3VAPKMVEEL 108 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2680 

50 A DNA sequence (GASx983) was identified in S.pyogenes <SEQ ID 7859> which encodes the amino acid 
sequence <SEQ ID 7860>. Analysis of this protein sequence reveals the following: 

Possible site: 45 
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>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3098 (Affirmative) < succ 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MIKTRDQSIFDEMFKRIQSLGFKVYDYKPMTEVPYPFVEMESTDAEYIPNKDDIKGSVEL 60 

MIKTRDQS I FDE+FKRIQ+LG4- VYDYKPM EV YPFVE+E+T + NK DIKG+V L 
Sbjct: 1 MIKTRDQSIFDELFI<RIQALGYTVYDYKP^INEVGYPFVELENTQTIHEANKTDIKGTVSt 1 60 

Query: 61 MLS VWGVQKKRKQVSDMASAI FSQALTVES SDVFRVJSLNTRQSS IQMLDDTTTVTPLKRA 120 

LSVWG+QKKRK+VSDMAS IF+QAL + ++D + W+LN++ S+IQMLDDTTT TPLKRA 
Sbjct: 61 SLSWGLQKKRKEVSD^SNIFNQALNISATDGYSMALNSQASTIQMLDDTTTHTPLKRA 120 

Query: 121 IVTLRFNLR 129 

++ L F LR 
Sbjct: 121 LINLEFRLR 129 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2681 

A DNA sequence (GASx984R) was identified in S.pyogenes <SEQ ID 786 1> which encodes the amino acid 
sequence <SEQ ID 7862>. Analysis of this protein sequence reveals the following: 
Possible site: 36 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1736 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2682 

A DNA sequence (GASx985) was identified in S.pyogenes <SEQ ID 7863> which encodes the amino acid 
sequence <SEQ ID 7864>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agaJactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA32613 GB:L31366 structural protein [Bacteriophage Tuc2009] 
Identities = 81/185 (43%), Positives = 111/185 (59%), Gaps = 22/185 (11%) 



+++KA ++GE++++W+ID E 







Sb j Ct : 


3 




64 


Sbjct: 


63 




124 


Sbjct: 


102 




184 


Sbjct: 




sed on 


this 



+GKY A Y + Y++SFS + N+E+ +E+ +EF + G QKG ATLT+ Q VQY FK 



DTVAG 165 



antigens for vaccines or diagnostics. 
Example 2683 

A DNA sequence (GASx986) was identified in S.pyogenes <SEQ ID 7865> which encodes the amino acid 
sequence <SEQ ID 7866>. Analysis of this protein sequence reveals the following: 
Possible site: 55 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2273 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MQbEIKGKTHNVKFGTRFVAEMDKNHIAERQGFKFGAGLQSSV-PFLIDHSWTLAEVIY 59 
M+L IKGK + KFG +FV E+DKN + E+ G FG L + P L ++ TL+ V++ 

Query: 60 TGTITEPPRPSLNDIYDYIDEVEDIEKLFDDVLDELRQSNASKLFMAQVEK 110 

G TE P+ S DI D+IDE EDIEKLFDDVL E+ +SN KL A++ K 
Sbjct: 61 LGNRTETPKLSQGDIDDFIDECEDIEKLFDDVLKEITESNTGKLIKAKMTK 111 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2684 

A DNA sequence (GASx987) was identified in S.pyogenes <SEQ ID 7867> which encodes the amino acid 
sequence <SEQ ID 7868>. Analysis of this protein sequence reveals the following: 
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I- terminal signal sequence 



- Final Results 



bacterial cytoplasm Cert, 

bacterial membrane Cert; 

bacterial outside Certaint; 



=0.2735 (Affirmative) . 
=0.0000 (Not Clear) < i 
=0.0000 (Not Clear) < t 



10 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA59193 GB:X84706 c2 [Bacteriophage Bl] 
Identities = 40/111 (36%) , Positives = 57/111 (51%) , Gaps = 10/111 (9%) 

15 Query: 2 IVLNCIRYLGMTDlNEIGRLTLYEYDLLMTGKAIiUWDESHKAHKQAWINHQVTATKLVG 61 

+++ +R G+ D+-+ R+T+ EY + L +DE ++QAW N QV ATK G 

Sbjct: 15 MMIRFLRCFGIQDLSVFERMTIREY3IRSIAFQLRTLDEEEFIYEQAWANWQVQATKQQG 74 

Query: 62 GKKNKKE VP VYKKFKDFFD YEEE I RKI - TQEIDEGYDKKGMDLLLKAN 108 

20 K P+Y FK FFD E EI I + E D K +DL+ KAN 

Sbjct: 75 KK PLYPTFKKFFDKKKLENEILGIESPENKFKKDNKLIDLMKKAN 119 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

25 Example 2685 

A DNA sequence (GASx989) was identified in S.pyogenes <SEQ ID 7869> which encodes the amino acid 
sequence <SEQ ID 7870>. Analysis of this protein sequence reveals the following: 
Possible site: 60 
30 >» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2869 (Affirmative) < succ: 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

*GP:CAA66560 GB:X97918 gene 19.1 [Bacteriophage SPP1] 
Identities = 66/232 (28%) , Positives = 106/232 (45%) , Gaps = 12/232 (5%) 



V GR V +E ++ G +G ++ R+LE+ A + G ++ +R + 



Query: 


38 


Sbjct: 


24 


Query: 


97 


Sbjct: 


84 


Query: 


152 


Sbjct: 


144 


Query: 


208 


Sbjct: 


203 



+ L F C +P K 



DVKTKKGTSIQNGGLFQTKPIITLNLSSPTKEIKLLHVESQKYIRLT GTYTTDEIK 207 

DV T T ++N GL T P I S+ E ++ ++ ++ G T D + 



+T NG++I+ L + 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2686 

A DNA sequence (GASx990) was identified in S.pyogenes <SEQ ID 7871> which encodes the amino acid 
sequence <SEQ ID 7872>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2861 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04681 GB:AP001510 unknotim conserved protein in others 
[Bacillus halodurans] 

i = 116/449 (25%) , Positives = 198/449 (43%) , Gaps = 79/449 (17%) 

IYLFDKLERLVATVG-TDDLLSWHFKVKNMDl'JDQASFEVPVDYDVEPFVyFGFFNYDPHQ 6 0 
+++FD+ ++L+ T+ + L+ F+ + N F ++ E + + HQ 

U i KDQ r M PI ^T^bVHALFREELNRVPNQPFAFTIEASSEEAKHV IEEHQ 59 



Identities 


Query: 


2 


Sbjct: 


4 




61 


Sbjct: 


60 


Query: 


110 


Sbjct: 


118 






Sbjct: 


176 




229 


Sbjct: 


229 


Query: 
Sbjct: 


289 




349 


Sbjct: 


320 




402 


Sbjct: 


367 



h Y+S A+ 1+ + +F TF N+IT 



--LFEHIS-- 319 



L G+4+R+E +E+E N K 
- - LNGMDERLERI IEEIEKNQGK 393 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2687 

A DNA sequence (GASx991) was identified in S.pyogenes <SEQ ID 7873> which encodes the amino acid 
sequence <SEQ ID 7874>. Analysis of this protein sequence reveals the following: 
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»> Seems to have no H-terminal signal sequence 

5 Final Results 

bacterial cytoplasm Certainty=0 . 2584 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 



10 No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA98101 GB:M19348 hyaluronidase [Streptococcus pyogenes phage 
H4489A] 

Identities = 314/371 (84%), Positives = 338/371 (90%), Gaps = 1/371 (0%) 

15 



Query: 


1 


MAENIPLRVQFKRMKAAEWASSDWLLEC-EIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 60 






M ENIPLRVQFKRM A EWA SDV+LLEGEIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 


Sbjct: 


1 


MTENIPLRVQFKRMSADEWARSDVILLEGEIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 60 


Query: 


61 


PKGDTGLQGKTGGTGSRGPAGKPGTTDYDQLQNKPDLGAFAQKEETNSKITKLESSKADK 120 






PKGDTGLQGKTGGTG RGPAGKPGTTDYEQLQNKPDLGAFAQKEETNSKITKLESSKADK 


Sbjct: 


61 


PKGDTGLQGKTGGTGPRGPAGKPGTTDYDQLQNKPDLGAFAQKEETNSKITKLESSKADK 120 



Query: 121 NAVYLKAESNAKLDEKLNLKGGVMTGQLQFKPN-SGIKPSSSVGGAINIDMSKSEGAAMV 179 
+AVY KAES +LD+KL+L GG++TGQLQFKPN SGIKPSSSVGGAINIDMSKSEGAAMV 

Query: 180 MYTNKDTTDGPLMILRSNKDTFDQSVQFVDYKGTraAVNIVMRQPTTPNFSSALNITSAN 239 

MYTNKDTTDGPLMILRS+KDTFDQS QFVDY G TNAVNIVMRQP+ PNFSSALNITSAN 
Sbjct: 181 MYTNKDTTDGPLMILRSDKDTFDQSAQFVDYSGIOTIAVNIvMRQPSAPNFSSALNITSAN 240 

Query: 240 EGGSAMQIRGVEKALGTLKITHENPSVDKEYDKNAAALSIDIVKKQKGGKGTAAQGIYIN 299 

EGGSAMQIRGVEKALGTLKITHENP+V+ +YD+NAAALSIDIVKKQKGGKGTAAQGIYIN 
Sbjct: 241 EGGSAMQIRGVEKALGTLKITHENPNVEAKYDENAAALSIDIVKKQKGGKGTAAQGIYIN 300 

Query: 300 STSGTTGKLLRIRNLNDDKFYVKPDGGFYAKETSQIDGNLKLKDPIANDHAATKAYVDGE 359 

STSGT GK+LRIRN N+DKFYV PDGGF++ S + GNL +KDP + HAATK YVD + 
Sbjct: 301 STSGTAGKMLRIRWKNEDKFYVGPDGGFHSGANSTVAGNLTVKDPTSGKHAATKDYVDEK 360 

Query: 360 VEKLKALIAAK 370 

+ +LK L+ K 
Sbjct: 361 IAELKKLILKK 371 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2688 

A DNA sequence (GASx993) was identified in S.pyogenes <SEQ ID 7875> which encodes the amino acid 
sequence <SEQ ID 7876>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1358 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0 0 0 0 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2689 

A DNA sequence (GASx995) was identified in S.pyogenes <SEQ ID 7877> which encodes the amino acid 
sequence <SEQ ID 7878>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 . 0855 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC34418 GB:AF158600 gpl49 [Streptococcus thermophilus 
bacteriophage Sfill] 
Identities = 27/95 (28%) , Positives = 50/95 (52%) , Gaps = 2/95 (2%) 

Query: 9 KYPQLDGTGAVASTHI I IAAEDGAVI PQLIKQDLTSTNDTE 1 1 KAALEEFKKSEYVE I AM E8 

K + D +GA +T +1+ DGA +P + + ++TE++K ALE + + + A 
Sbjct: 26 KSKEYDASGAAYATKVILKNRDGAYVPVFLPVEK1DLSNTELLKEALEVIYQENFPQRAE B5 

Query: 69 GEAVQKVDDLEKISQETARTAKTAQTAAGLAKVSA 103 

E ++D EKI + A+K +TA+ + S+ 
Sbjct: 86 NEKFNELD- -EKIKEYEALSKKATETIAKMEEASS 118 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2690 

A DNA sequence (GASx996) was identified in S.pyogenes <SEQ ID 7879> which encodes the amino acid 
sequence <SEQ ID 7880>. Analysis of this protein sequence reveals the following: 

Possible site: 22 



Final Results 

bacterial membrane Certainty=0. 2848 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2691 

A DNA sequence (GASx997) was identified in S.pyogenes <SEQ ID 7881> which encodes the a 
sequence <SEQ ID 7882>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

10 N-terminal signal sequence 

38 - 54 ( 35 - 55) 



Final Results 

bacterial membrane Certainty=0 .2466 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2692 

A DNA sequence (GASx998R) was identified in S.pyogenes <SEQ ID 7883> which 
sequence <SEQ ID 7884>. Analysis of this protein sequence reveals the following: 
Possible site: 27 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -9.87 Transmembrane 47 - 63 ( 41 - 72) 



Final Results 

bacterial membrane Certainty=0.4949(Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2693 

A DNA sequence (GASx999) was identified in S.pyogenes <SEQ ID 7885> which encodes the amino acid 
sequence <SEQ ID 7886>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside — Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2694 

A DNA sequence (GASxlOOl) was identified in S.pyogenes <SEQ ID 7887> which encodes the amino acid 
5 sequence <SEQ ID 7888>. Analysis of this protein sequence reveals the following: 

PoGsible site: 22 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-10.51 Transmembrane 18 - 34 ( 16 - 34) 

10 

Final Results 

bacterial membrane Certainty=0. 5203 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

20 Example 2695 

A DNA sequence (GASxl002) was identified in S.pyogenes <SEQ ID 7889> which encodes the amino acid 

sequence <SEQ ID 7S90>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

25 >» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -3.61 Transmembrane 12 - 28 ( 11 - 33) 

Final Results 

bacterial membrane Certainty=0. 2444 (Affirmative) < suco 

30 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein is similar to AF186180 from S.equi. 
35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2696 

A DNA sequence (GASxl003) was identified in S.pyogenes <SEQ ID 789 1> which encodes the amino acid 
sequence <SEQ ID 7892>. Analysis of this protein sequence reveals the following: 

40 Possible site: 32 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

45 bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 
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-2764- 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein is similar to SeeH from S.equi: 

>GP:AAF72809 GB:AF186180 SeeH [Streptococcus equi] Length = 

Identities = 233/236 (98%) , Positives = 234/236 (98%) 

Sbjct: 

Query: 61 NSPDIVTSHMLKYSVKDKNLSVFFEKDWISQEFKDKEVDIYALSAQEVCKCPGKRYEAF6 120 

NSPDIOTSHMLKYSVKDKNLSVFFEKDWISQEFKDKEVDIYALSAQE CECPGKRYEAFG 
Sbjct: 61 NSPDIVTSHMLKYSVKDKNLSVFFEKDWISQEFKDKEVDIYALSAQEACECPGKRYEAFG 120 

Query: 121 GITLTNSEKKEIKA7PVNVWDKSKQQPPMFITVNKPKATIAQEVDIKVRKLLIKKYDIYMNR 180 

GITLTNSEKKEIKVP+NVWDKSKQ PPMFITVNKPKVTAQEVDIKVRKLLIKKYDIYNNR 
Sbjct: 121 GITLTNSEKKEIKVPINVWDKSKQHPPMFITWKPKOTAQEVDIKVRKLLIKKYDIYNNR 180 

Query: 181 EQKYSKGTVTLDLNSGKDIVFDLYYFGNGDFNSMLKIYSNNERIDSTQFHVDVSIS 236 

EQKYSKGTVTLDLNSGKDI VFDLYYFGNGDFNSMLKI YSNNERIDSTQFHVDVS I S 
Sbjct: 181 EQKYSKGTVTLDLNSGKDI VFDLYYFGNGDFNSMLKI YSNNERIDSTQFHVDVS IS 236 

There is also homology to a S. aureus enterotoxin: 

: [Staphylococcus aureus] 
s = 108/215 (49%) , Gaps = 19/215 (8%) 

Query: 27 SNWQANSYNTTNRHNLESLYKHDSNLIEADSI - KNSPDIVTSHMLKYSVKDKNLSVFFE 85 

+++ AN+Y N ++ KD EDI+ND +K++ D 
Sbjct: 34 TDIALANAYGQYNHPFIKENIKSDEISGEKDLIFRNQGDSGNDLRVKFATAD 85 

Query: 86 KDWISQEFKDKEVDIYALSAQEVCECPGKRYEA--FGGITLTNSEK- -KEIKVPVNVWDK 141 

++Q+FK+K VDIY S CE + +GG TL NSEK +E + NVW 

Sbjct: 86 LAQKFKNKNVDIYGASFYYKCEKISENISECLYGGTTL-NSEKIAQERVIGANVWVD 141 

Query: 142 SKQQPPMFITWKPKVTAQEVDIKVRKLLIKKYDIYNNREQOSKGTVTLDLNSGKDIVF 201 

Q+ I NK VT QE+DIK+RK+L KY II ++ + SKG + D+ + +D F 
Sbjct: 142 GIQKETELIRTNKKNVTLQELDIKIRKILSDKYKIY-YKDSEISKGLIEFDMKTPRDYSF 200 

Query: 202 DLYYFGNGDFNSMLKIYSNNERIDSTQF-HVDVSI 235 

D+Y + + KIY +N+ + S H+DV++ 

Sbjct: 201 DIYDLKGENDYEIDKIYEDNKTLKSDDISHIDVNL 235 

>GP:AAC26661 GB:AF064774 extracellular enterotoxin type I precursor 
[Staphylococcus aureus] 
Identities = 68/214 (31%) , Positives = 109/214 (50%) , Gaps = 27/214 (12%) 

Query: 42 NLESLY-KHDSKLIEADSIKNSPDIVTSHMLKYSVKDKNLSVFFEKDWIS-QEFKDKEVD 99 

Ml + Y KHD ++ + KN P ++ L++S +L + +W +FK K+4-D 
Sbjct: 32 NLRNFYTKHDYIDLKGVTDKNLP IANQLEFSTGTNDL- ISESNNWDEISKFKGKKLD 87 

Query: 100 IYALSAQEVCECPGKRYEAFGGITLTNSEKKEI -KVPVNVWDKSKQQPPMF- - ITVNKPK 156 

1+ + C K +GG TL+ K+P+N+W K + I NK 

Sbjct: 88 IFGIDYNGPC KSK^MYGGATLSGQYMSARKIPINLVfVNGKHKTISTDKIATNKKL 143 

Query: 157 VTAQEVDIKVRKLLIKKYDIYNNRE QKYSKGTVTLDLNSGKDIVFD 202 

VTAQE+D+K+R+ L ++Y+IY + ++ G V LN+ K +D 

Sbjct: 144 OTAQEIDVHL.RRYIQEEYNIYGmeraSKGKEYGYKSKFYSGFNNGKOTjFHLNNEKSFSYD 203 

Query: 203 LYYFGNGDFNSMLKIYSNNERIDSTQFHVDVSIS 236 

L+Y G+G S LKIY +N+ I+S +FH+DV IS 
Sbjct: 204 LFYTGDGLPVSFLKIYEDNKIIESEKFHLDVEIS 237 



>GP:AAC28968 GB:U93688 enterotoxin [Staphylococcus aureus] 
Identities = 70/244 (28%) , Positives = 127/244 (51%) , Gaps = 27/244 (11%) 

Query: 12 KKIYSMIICLSFLLYSNWQANSYOTTNRHMjESLYKHDSNLIEADSIKNSPDIVTSHML 71 
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Sbjct: 


2 


KKLISILL-1NIIILC3VSHNASAQGDIGIDMLENFYTK-KDFVDLKDVKDN-DTPIANQL 


58 


Query: 


72 


KYSVKDKNLSVFFEECDWIS-QEFKDKEVDIYALSAQEVCECPGKRYEAFGGITLTNSE-K 


129 






++S + +L + KD+ FK K++D++ +S C +Y +GG+T TW 




Sbjct: 


59 


QFSNESYDL- ISESKDFNKFSHFKGKKIDVFGISYNGQCNT- --KY- IYGGVTATNEYLD 


113 


Query: 


130 


KEIKVPVNVW- -DKSKQQPPMFITVNKPKVTAQEVDIKVRKLLIKKYDIYNNREQK 


183 


Sbjct: 


114 


K +P+N+W K ++ NK VTAQE+D+K+RK L ++Y+IY + K 


173 


Query: 


184 


YSKGTVTLDLNSGKDIVFDLYYFG-NGDFNSMLKIYSNNERIDSTQFHVD 


232 






++ G VT LN+ +DL+Y G +G S LKIY +N+ ++S +FH4-D 




Sbjct: 


174 


YGHKSKFYSGFNIGKVTFHLNNNDTFSYDLFYTGDDGLFKSFLKIYEDNKTVESEKFHLD 


233 




233 


VSIS 236 








V IS 




Sbjct: 


234 


VDIS 237 





20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2697 

A DNA sequence (GASxl004R) was identified in S.pyogenes <SEQ ID 7893> which encodes the ammo 
acid sequence <SEQ ID 7894>. Analysis of this protein sequence reveals the following: 

25 Possible site: 29 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -2.18 Transmembrane 12 - 28 ( 12 - 28) 

30 Final Results 

bacterial membrane Certainty=0. 1871 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

35 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2698 

40 A DNA sequence (GASxl009) was identified in S.pyogenes <SEQ ID 7895> which encodes the amino acid 
sequence <SEQ ID 7896>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

45 

Final Results 

bacterial cytoplasm Certainty=0. 6391 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

50 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2699 

A DNA sequence (GASxlOl 1) was identified in S.pyogenes <SEQ ID 7897> which encodes the amino acid 
5 sequence <SEQ ID 7898>. Analysis of this protein sequence reveals the following: 
Possible site: 50 

>» Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 .4528 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2700 

20 A DNA sequence (GASxl024) was identified in S.pyogenes <SEQ ID 7899> which encodes the amino acid 
sequence <SEQ ID 7900>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

>» Seems to have an uncleavable N-term signal seq 

25 

Final Results 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2701 

A DNA sequence (GASxl033) was identified in S.pyogenes <SEQ ID 790 1> which encodes the amino acid 
sequence <SEQ ID 7902>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

40 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1652 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

45 bacterial outside — - Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2702 

5 A DNA sequence (GASxl039) was identified in S.pyogenes <SEQ ID 7903> which encodes the amino acid 
sequence <SEQ ID 7904>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>» Seems to have no N-terminal signal sequence 
10 INTEGRAL Likelihood = -1.06 Transmembrane 15 - 31 ( 15 - 31) 

Final Results 

bacterial membrane Certainty=0. 142 6 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Hot Clear) < suco 

15 bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

"No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2703 

A DNA sequence (GASxl058) was identified in S.pyogenes <SEQ ID 7905> which encodes the amino acid 
sequence <SEQ ID 7906>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 5484 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2704 

A DNA sequence (GASxl077) was identified in S.pyogenes <SEQ ID 7907> which encodes the amino acid 
sequence <SEQ ID 7908>. Analysis of this protein sequence reveals the following: 

40 Possible site: 31 

»> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0. 484 8 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2705 

A DNA sequence (GASxl080) was identified in S.pyogenes <SEQ ID 7909> which encodes the amino acid 
sequence <SEQ ID 79 1 0>. Analysis of this protein sequence reveals the following: 
Possible site: 40 



INTEGRAL 
INTEGRAL 
INTEGRAL 



have an uncleavabl< 
Likelihood =-12 
Likelihood =-11 
Likelihood = -8 
Likelihood = -5 
Likelihood = -4 
Likelihood = -1 



Transmembrane 
Transmembrane 
Transmembrane 
Transmembrane 



107 - 123 ( 93-1 

20 - 35 ( 14 - 

226 - 242 (218-2 

129 - 145 (126-1 

160 - 176 (159-1 

55 - 71 ( 55 - 



Final Results 

bacterial membrane Certainty=0 . 5967 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2706 

A DNA sequence (GASxl081) was identified in S.pyogenes <SEQ ID 791 1> which 
sequence <SEQ ID 7912>. Analysis of this protein sequence reveals the following: 



isible site: 34 

■ Seems to have a cleavable 1 
INTEGRAL Likelihood =-13 
Likelihood =-11 
Likelihood = -8 
Likelihood = -5 
Likelihood = -4 



INTEGRAL 
INTEGRAL 
INTEGRAL 



.-m signal seq. 

Transmembrane 103 - 119 

Transmembrane 208 - 224 

Transmembrane 54 - 70 

Transmembrane 160 - 176 

Transmembrane 127 - 143 



Pinal Results 

bacterial membrane Certainty=0. 6201 (Affirmative) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2707 

A DNA sequence (GASxl089) was identified in S.pyogenes <SEQ ID 7913> which encodes the amino acid 
sequence <SEQ ID 7914>. Analysis of this protein sequence reveals the following: 

Possible site: 37 
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>>> Seems to have no N-terminal signal sequence 

Final Results 

5 bacterial cytoplasm Certainty=0 .2999 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

1 0 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2708 

A DNA sequence (GASxl 109) was identified in S.pyogenes <SEQ ID 7915> which encodes the amino acid 
15 sequence <SEQ ID 7916>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

>» Seems to have no N-terminal signal sequence 

20 Final Results 

bacterial cytoplasm — Certainty=0 . 1270 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

25 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2709 

30 A DNA sequence (GASxl 114R) was identified in S.pyogenes <SEQ ID 7917> which encodes the amino 
acid sequence <SEQ ID 7918>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>>> Seems to have no N-terminal signal sequence 

35 

Final Results 

bacterial cytoplasm Certainty=0. 4021 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2710 

A DNA sequence (GASxll49) was identified in S.pyogenes <SEQ ID 791 9> which encodes the amino acid 
sequence <SEQ ID 7920>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

n signal seq 

12 - 28 ( 12 - 29) 



Final Results 

bacterial membrane Certainty=0. 1680 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2711 

A DNA sequence (GASxl 150) was identified in S.pyogenes <SEQ ID 792 1> which encodes the amino acid 
sequence <SEQ ID 7922>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

>» Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2712 

A DNA sequence (GASxl 160) was identified in S.pyogenes <SEQ ID 7923> which encodes the amino acid 
sequence <SEQ ID 7924>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

10 N-terminal signal sequence 

15 - 31 ( 15 - 31) 

Final Results 

bacterial membrane Certainty=0 .2275 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2713 

A DNA sequence (GASxll67) was identified in S.pyogenes <SEQ ID 7925> which encodes the amino acid 
5 sequence <SEQ ID 7926>. Analysis of this protein sequence reveals the following: 
Possible site: 16 

»> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0. 1404 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < succi 

15 No corresponding DNA sequence was identified in S. agalactia? . 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB99233 GB:U67563 oxaloacetate decarboxylase alpha chain (oadA) 
[Methanococcus jannaschii] 
Identities = 250/453 (55%), Positives = 325/453 (71%), Gaps = 7/453 (1%) 

20 



Query: 


13 


VAITETVLRDGHQSLMATRLSIEDMLPVLTILDKIGYYSLECWGGATFDACIRFRIEDPW 


72 






V I +T RD QSL+ATR+ EDMLP+ +D++G+YS+E WGGATFDACIR+LNEDPW 




Sbjct: 




VKIVDTTFRDAQQSLIATRmTEDMrjPIAEKMDEVGFYSMEVWGGATFDACIRYENEDPW 


61 


Query: 


73 


ERLRTLKKGLPNTRLQMLLRGQNLLGYRHyADDIVDKFISLSAQNGIDVFRIFDALNDPR 


132 






ERLR LKK + NT tiQMLLRGQNL+GYRHY DDIV+KF+ + +NGID+FRIFDALND R 




Sbjct: 


62 


ERLRALKKRIQNTPLQMLLRGQNLVGYRHYPDDIVEKFVIKAHENGIDIFRIFDALNDVR 


121 



Query: 133 NIQCALRAVKKTGI<EaQLCIAYTTSPVIiTLNYyLSLVI<ELVEMGADSICIKDMAGILTPK 192 

N++ A++ KKGBQ 1 YT SPVHT++ Y+ L K+L EMG DS I CI KDMAG+LTP 
Sbjct: 122 N^TAIKTAKKVGAEVQGAICYTISPVHTIDQYTOIAKKLEEMGCDSICIKDMAGLLTPY 181 

Query: 193 AAKELVSGIKAMTNDPLIVHTHATSGISQMTYLAAVEAGADRIDTALSPFSEGTSQPATE 252 

ELV +K +LP+ VH+H TSG++ MTYIj +EAGAD +D A+SPF+ GTSQP TE 
Sbjct: 182 EGYELVKRLKEEISLPIDVHSHCTSGLAPMTYLKVIEAGADMVDCAISPFAMGTSQPPTE 241 



Query: 253 SMYLALKEASYDITLDETLLEQAANHLRQARQKYLADGILDPSLLFPDPRTLQYQVPGGM 312 

S+ +ALK YD LD LL + ++ + R+KY + P D R L YQVPGGM 

Sbjct: 242 SIWALKGTKYDTGLDLKLLNEIRDYFMKVREKYI04--LFSPISQIVDARVLVYQVPGGM 299 

Query: 313 LSNMLSQLKQANAESKLEEVI J AEVPRWKDIX3YPPLVTPLSQ^IVGTQAAMNVILGKPYQM 372 

LSN++SQLK+ A K EEVL E+PRVRKDLGYPPLVTP SQ+VGTQA +NV+ + Y++ 
Sbjct: 300 LSNLVSQLKEQGALDKFEEVLQEIPRWKDLGYPPLWPTSQIVGTQAVIjNVLTEERYKI 359 

Query: 373 VSKEIKQYLAGDYGKTPAPVNEDLKRSQI - -GSAPVTTNRPADQLSPEFEVLK- -AEVAD 428 

++ E+ Y+ G YGK PAP+N +L + + G P+T RPAD L PE+E +K AE 
Sbjct: 360 ITNEVWYVKGFYGKPPAPINPELLKRVLDEGEKPITC-RPADLLPPEWEKVKKEAEEKG 418 

Query: 429 LAQTDEDVLTYALFPSVAKPFLTTKYQTDDVIK 461 

+ + 4ED+LTYAL+P +A FL + + + + K 
Sbjct: 419 IVKKEEDILTYALYPQIAVKFLRGELKfiEPIPK 451 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2714 

A DNA sequence (GASxl 168) was identified in S.pyogenes <SEQ ID 7927> which encodes the amino acid 
sequence <SEQ ID 7928>. Analysis of this protein sequence reveals the following: 
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Possible site: 38 



I- term signal seq 



- Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



• Certainty=0. 3845 (Affirmative) ■ 

• Certainty=0 . 0000 (Not Clear) < i 
■ Certainty=0. 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2715 

A DNA sequence (GASxll70) was identified in S.pyogenes <SEQ ID 7929> which encodes the amino acid 
sequence <SEQ ID 7930>. Analysis of this protein sequence reveals the following: 
Possible site: 51 



» Seems to 
INTEGRAL 
INTEGRAL 



I-terminal signal sequence 



Likelihood 
Likelihood = -5 
Likelihood = -5 
Likelihood = -4 
Likelihood = -4 
Likelihood = -3 
Likelihood = -2 
Likelihood = -0 
Likelihood = -0 



Transmembrane 211 - 

Transmembrane 117 - 

Transmembrane 256 - 

Transmembrane 44 - 

Transmembrane 287 - 

Transmembrane 358 - 

Transmembrane 20 - 



53 



• Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm • 



Transmembrane 165 - 



-- Certainty=0. 3824 (Affirmative: 
-- Certainty=0. 0000 (Not Clear) ■ 
-- Certainty=0. 0000 (Not Clear) ■ 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA05140 GB:AJ002015 methylmalonyl-CoA decarboxylase, 
beta- sub-unit [Propionigenium modestum] 
Identities = 231/395 (58%), Positives = 293/395 (73%), Gaps = 19/395 (4%) 

MLDVLNQWQSSGIJffiLTVMNLIMICLASFFLYIXSIKKEYEPYLIWPIAFGILLVNLP^ 6 0 
ML + S+G L 4 ++IM+ +A FLYL I KE+EP L+VPI+FGILL NLP A 

MLQAILDFYHSTGFhTGiaiMGSIIMMLvACVFLYLAIAKEFEPLLLVPISFGILLTNLPFA 60 



Query: 


1 


Sbjct: 


1 




61 


Sbjct: 


61 


Query: 


103 


Sbjct: 


121 




162 


Sbjct: 


181 


Query: 


222 



PLI+NPK++LLG ARQ GIF+ FF AI G+ T QEAAS+GIIGGADGPTAI++++KLAP 
PLISNPKSLLLGAAAQFGI FVTFFGAIASGLFTAQEAASIGI 1GGADGPTAI FLSSKIAP 180 

DLLSTIAIAAYSYMALVPI IQPPIIKLLTTKAERQVKMTQARTVSQKEKI I FPIMVTIFV 221 

L+ IA+AAYSYMALVPIIQPPI+ LT++ ER++KM+Q R VS++EKIIFPI+VTI V 
HLMGPIAVAAYSVMALVPIIQPPIMTALTSETERKIKMSQLRLVSKREKIIFPIVVTILV 240 

Query: 222 SLLVPSATTLVGCLMLGNLVREIKrVPKIVENLQQWMFCITIILGLTVGAKANGDLFLS 281 
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SL+VP A TLVG LMLGNL RE +V ++ + + ++ ITI LG+TVGA A + FL 
Sbjct: 241 SLIVPPAATDVGMLMLGNLFRECGVVGFBEDTAKNALINI ITI FLGVTVGATATAEAFLK 300 

Query: 282 ATTLKIIALGLIAFARGTAGGVLMGKVMYYLSGNKVNPMIGAAGVSAVPMftARWQKIGQ 341 

TL 1+ LG++AF GT GVL+ KM LS +NP++G4-AGVSAVPMAARV Q +GQ 
Sbjct: 301 VETIAILGLGIVAFGIGTGSGVLLAKFMNKLSKEPINPLLGSAGVSAVPMAARVSQWGQ 360 

Query: 342 EEDPSNFLLMHAMGPNVAGVIGSAIASGALLAFFG 376 

+ DP+NFLLMHAMGPNVAGVIGSA+++G LL+ FG 
Sbjct: 361 KADPTNFLLMHAMGPNVAGVIGSAVSAGVLLSLFG 395 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2716 

A DNA sequence (GASxll71R) was identified in S.pyogenes <SEQ ID 7931> which encodes the amino 
acid sequence <SEQ ID 7932>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

>» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm — Certainty=0 . 0851 (Affirmative) < succ 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 9 ISQIALKALLYEVSLSPKPGLVDRFDNGAHDDMSFITFIDSMIALSPFFQAYIETGFAYA 68 

+ LA A++ EV L+PKPGLVD +NGAH DM TFI S A++P+ +++ G+ A 
Sbjct: 32 VGHIAYHAMMLEVHLTPKPGLVDTANNGAHRDMDLNTFIASAEAIAPYLHEFVSAGWESA 91 

Query: 69 KEEPDLLFNRLRQLGQKAEETMFCATQGINTHKGI^FSMAIjLLGATGAYLARTPHLMTDL 128 

L + LR +G +AE+ MF ATQG+NTHKG+ F + L+ G+ G A 
Sbjct: 92 GNPAAQIiLSALRPIGIEAEQAMFAATQGVNTHKGMIFILGLICGSVGWLKANQ 144 



Query: 129 GRFSKEDTIAICRLVKPMTAHLIQTDLGHLNTKKEFTYGEQLFVTYGIKGPRGEASEGFT 188 

K D I ++ L+ +L + T GE+++ YG+ G RGEA+ G 

Sbjct: 145 LKIDAQHIGETIRQACQFLVIDELKAKRDCEPETAGERIYRQYGLTGARGEAASGLA 201 

Query: 189 TLTDHALPYFRQMISQN-DPETSQLRLLVYLMSIVEDGNLIHRGGIEAWKGVKAD-MRLL 246 

+ HALP ++ +++ E + L+ LM+ D NL+ RGG+ V+ +LL 

Sbjct: 202 MVMQHALPAYQACLTKGASTEQALWHTLLVL^IANNNDSNLVSRGGIAGLHFVQEQAQQLL 261 



Query: 247 LQQDIjSTTDLRLALSSYNQCLINQHLSPGGAADLLALTFYFAFLEKIi 293 

+ ++ AL++ + LI +HLSPGG+ADLLA T+ L +L 

Sbjct: 262 AKGGFLYQEIEQALTALDSVLIEKHLSPGGSADLLAATKLIYELVQL 308 

50 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2717 

A DNA sequence (GASxll72R) was identified in S.pyogenes <SEQ ID 7933> which encodes the amino 
55 acid sequence <SEQ ID 7934>. Analysis of this protein sequence reveals the following: 

Possible site: 23 



WO 02/34771 



-2774- 



PCT/GB01/04789 



>>> Seems to have r 



N-terminal signal sequence 



- Final Results 

bacterial cytoplasm - 
bacterial membrane - 
bacterial outside - 



- Certainty=0. 2501 (Affirmative) ■ 

- Certainty=0 . 0000 (Not Clear) < : 

- Certainty=0 . 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP.-CAB12389 GB:Z99107 similar to transcriptional regulator (GntR 
family) [Bacillus subtilis] 
Identities = 60/205 (29%) , Positives = 99/205 (48%) , Gaps = 3/205 (1%) 





19 


Sbjct: 


11 






Sbjct: 


71 


Query: 


137 


Sbjct: 


131 


Query: 


197 


Sbjct: 


190 



b FYN LKK I 



+SR+PIR A+ LL ++ L++ 



Based on this analysis, it was predicted that this GAS-spccific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2718 

A DNA sequence (GASxll73R) was identified in S.pyogenes <SEQ ID 7935> which encodes the amino 
acid sequence <SEQ ID 7936>. Analysis of this protein sequence reveals the following: 

Possible site: 16 



INTEGRAL 



INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 
INTEGRAL 



have a cleavable N- 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 
Likelihood 



rm signal seg. 
Transmembrane 



34 



Transmembrane 
Transmembrane 
Transmembrane 
Transmembrane 
Transmembrane 
Transmembrane 
Transmembrane 



450 - 466 ( 445 - 

33 - 49 ( 30 - 

326 - 342 ( 321 • 

288 - 304 ( 286 - 

95 - 111 ( 88 • 

- 281 ( 264 ■ 



208 



224 ( 204 
142 ( 126 - 



■ Final Results - 

bacterial t 
bacterial outside - 
bacterial cytoplasm - 



- Certainty=0. 53 94 (Affirmative; 
-- Certainty=0. 0000 (Not Clear) ■ 
-- Certainty=0. 0000 (Not Clear) ■ 



s also identified in GAS <SEQ ID 9169> which encodes the amino acid sequence 



55 A related sequence was 

<SEQ ID 9170>. Analysis of this protein sequence reveals the following 

Possible cleavage site: 39 

»> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood =-10.99 Transmembrane 443 - 459 ( 438 - 466) 
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INTEGRAL 
INTEGRAL 
INTEGRAL 



INTEGRAL 
INTEGRAL 
INTEGRAL 



Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = - 
Likelihood = • 
Likelihood = - 
Likelihood = ■ 



Transmembrane 31£ 



Transmembrane 
Transmembrane 
Transmembrane 



34 Transmembrane 412 - 



- Final Results 

bacterial membrane --- 
bacterial outside --- 
bacterial cytoplasm --- 



410 - 431! 



Certainty=0. 539 (Affirmative) ■ 
Certainty=0.Q000 (Not Clear) < 
Certainty=0. 0000 (Not Clear) < 



15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



20 Query: 9 liTMLAYAMIIVFMYVVMKKKMTPFTALVMIPLIMTIAVILTGSADFNADAKFVAFVGDG 68 
+LT+LA+AM+ FM+++M K+++ AL+++P +AF G 

Sbjct: 1 MLTLLAFAMVATFMFLIMTKRLSALIALILVP IAFALIG 39 

Query: 69 GIAKDLTAIGPMVMYGINNTAKTGIMLLFAILFFSVMLDAGLFDPITEKMIRFAKGDPMK 128 
25 GAL GPM++ GI A TG+ML+FAIL+F++M+D+GLFDP K++R KGDP+K 

Sbjct: 40 GFAAGL---GPMMLDGIRTLAPTGVMLMFAILYFAIMIDSGLFDPAVRKILRLVKGDPLK 96 

Query: 129 VLIATAVVAAAVSLNGDGTTTTLICCSAFLPIYKKLDMKIMNLGVLIILQNTIMNLLPWG 188 
V + TA +A VSL+GDG+TT +IC +A LP+Y +L M + + LI+L + ++N+ PWG 
30 Sbjct: 97 VSLGTAAIJWIVSLDGDGSTTYMICTAAVLPLYSPJ^GMSPLVMACLIMLSSGVLNMTPWG 156 

Query: 189 GPTARAMSVLGVGP-EILGYIiAPGMILSLL- -YVICWVAPSMGRKERARLGVIDL- -SEE 243 

GPTARA S L V P +1 + P MI LL + I W+ G++ERARLG + L E 
Sbjct: 157 GPTARA&SALHVDPADIFVPMIPAMIAGLLAIFAIAWI YGKRERARLGELHLPTDHE 213 

35 

Query: 244 DMRQLTDITDPDTLFIRRPKNFVFNAILTIGLITWLVAGSFNKSIAMAPLLLFAVGTCIA 303 

D+ +++ P+ RRPK FNAILT+ L+ L+AG + M L + A G IA 

Sbjct: 214 DLAEISVSQYPEA RRPKLLWFNAILTWLMATLIAGL LPMPVLFMIAFG- - IA 264 

40 Query: 304 LMVNYPVLKDQSKRIGDNAGDAVQWILVFAAGIFMGLFQGSGMASALAQSFATIIPKQL 363 

++VNYP +++Q KRIG +A + + W L+FAAG-hF G+ G+GM A++4S +IP L 
Sbjct: 265 MIYNYPCIQEQKKRIGAHAENILAVVSLIFAAGVFTGILSGTGMVDAMSKSLLAVIPPAL 324 

Query: 364 AGFWGLVIALVSAPGTFFISNDGFYYGILPVIjAFAGIAEYGFSNMAMALftSLMGQAFHLLS 423 
45 + + ALVS P TFF+SND FYYG+LP+L +A AEYG + + MA AS++GQ HLLS 

Sbjct: 325 GPYIiATITALVSMPFTFFMSNDAFYYGVLPILTQAAAEYGITPVEMARASIVGQPVHLLS 384 

Query: 424 PLVAF I YLLLRLTGLDMGEWQKEAAKYALI IFVI FWT 1 1 AMGQMPLY 471 
PLV YLL+ L +D G+ Q+ K+A+++ + + + +G PL+ 
50 Sbjct: 385 PLVPSTYLLVGIAKIDFGDHQRFTLKWAVLVCLAILAMALLLGLFPLF 432 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2719 

55 A DNA sequence (GASxl 174) was identified in S.pyogenes <SEQ ID 7937> which 
sequence <SEQ ID 793 8>. Analysis of this protein sequence reveals the following: 
Possible site: 57 



60 



> Seems to have no 
— Final Results - 



-terminal signal sequence 
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bacterial cytoplasm --- Certainty=0 . 3948 (Affirmative) < guco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2720 

10 A DNA sequence (GASxl 175) was identified in S. pyogenes <SEQ ID 7939> which encodes the amino acid 
sequence <SEQ ID 7940>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm — Certainty=0. 3519 (Affirmative) < suco 
bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

25 Example 2721 

A DNA sequence (GASxl 177) was identified in S.pyogenes <SEQ ID 7941> which encodes the amino acid 
sequence <SEQ ID 7942>. Analysis of this protein sequence reveals the following: 
Possible site: 60 



>> Seems to have an uncleavable N-term signal seg 

INTEGRAL Likelihood = -! 

INTEGRAL Likelihood = -I 

INTEGRAL Likelihood = 

INTEGRAL Likelihood = -< 

INTEGRAL Likelihood = - 

INTEGRAL Likelihood = -: 



Transmembrane 115 - 131 ( 105 - 137) 

Transmembrane 208 - 224 ( 204 - 238) 

Transmembrane 282 - 298 ( 273 - 303) 

Transmembrane 85 - 101 ( 75 - 102) 

04 Transmembrane 10 - 26 ( 3 - 32) 

61 Transmembrane 255 - 271 ( 253 - 271) 



Final Results 

bacterial membrane Certainty=0. 4694 (Affirmative) < suco 

bacterial outside Oertainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB89172 GB:AE000960 oxaloacetate decarboxylase, sodium ion pump 
subunit (oadB) [Archaeoglobus fulgidus] 
Identities = 190/354 (53%) , Positives = 255/354 (71%) , Gaps = 8/354 (2%) 

Query: 16 IVMWIGALLMYLGIKKEYEPTLLVTMGLGTILVNFrX3SGvLTQVA^GVEQEGVFEALFN 75 

+VM+ +G LL+YLGI K+ EP LLVP+G+G ILVN PG G+ E+ +F+ 

Sbjct: 5 LVMIGVGLLLVYLGIVKKMEPLLLVPIGIGAILVNIPGGGL AEEGS I FDLFLK 57 
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Query: 76 FGIGTELFPLLIFIGIGAMIDFGPLLQNPFMLLFGDAAQFGIFFvvWAVIjAGFDIKEAA 135 

+ I TE+ PLLIF+G+GA+ DF PLL NP L G AAQ GIF ++ A+ GF +EAA 
Sbjct: 58 YLIHTEIVPLLIFLGLGALTDFSPLLANPKTFLLGAAAQIGIFAALIAALFLGFTPQEAA 117 

Query: 136 SIGIIGAMDGPTSIFVAWQLAKDLIfiPITWUlYSYMALVPIIQPFAIKLVTTKKERRIRM 195 

SIGIIG ADGPT+I+ LA LL VAAYSYM+LVPIIQP IK +T+ +ER+I+M 
Sbjct: 118 S IGI IGGADGPTTI YTTT I LAPHLLAATAVAAYSYMSLVPI I QPPI I KALTS SRERKI KM 177 

Query: 196 TYKAENVSQMTKILFPIIITLVAGFIAPISLPLVGFLMFGHLLRECGVLDRLSQTAQNEL 255 

+ VS+ KILFPI +++GF+AP 4-LPIiVG LM GNL RE GV DRL++ A EL 
Sbjct: 178 R-QLRIVSKKEKILFPIATIIISGFLAPKALPLVGMLMTGNLFRESGVTDRLAKGASEEL 236 

Query: 256 VNIISILLGLTISIKMQADLFLNVQTLLIIVFGLLAFIMDSIGGVMFAKFLNLFRKEKIN 315 

+NI++I+LGL++ M+A+ FL +TLL++ G++AF + GGV+ AK +NLF KEKIN 
Sbjct: 237 MNIMTIILGLSVGSTMPAESFLTQKTLLVIJ^WAFAAATAGGVLIAKVMNLFLKEKIN 296 

Query: 316 PMIGAAGISAFPMSSRVIQKMATDEDPQKFILMYAVGANVSGQIASVIAGGLLL 369 

PMIGAAG+SA PMS+RV+Q++A +EDP N ILM+A+G NV+G I S +A G+L+ 
Sbjct: 297 PMIGAAGVSAVPMSARWQRLAIEEDPHNHILMHAMGPKVAGVIGSAVAAGVLI 350 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2722 

25 A DNA sequence (GASxl 1 78) was identified in S.pyogenes <SEQ ID 7943> which encodes the amino acid 
sequence <SEQ ID 7944>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

>» Seems to have no N-terminal signal sequence 
30 INTEGRAL Likelihood = -9.50 Transmembrane 21 - 37 ( 8 - 43) 

Final Results 

bacterial membrane — Certainty=0. 4800 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

35 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
40 antigens for vaccines or diagnostics. 

Example 2723 

A DNA sequence (GASxl 179) was identified in S.pyogenes <SEQ ID 7945> which encodes the amino acid 
sequence <SEQ ID 7946>. Analysis of this protein sequence reveals the following: 

Possible site: 60 

45 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .1906 (Affirmative) < suco 

50 bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



10 



15 
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>GP:AAF93961 GB:AE004165 citrate lyase, gamma subunit [Vibrio cholerae] 
Identities = 46/97 (47%) , Positives = 64/97 (65%) 

Query: 1 MDIKQTAVAGSLESSDLMITVSPNDEQTITITLDSSVEKQFC3NHIRQLIHQTLVNLKVTA 60 
5 MIA AG+LESSDL + 4 PN++ I + LDS+VE+QFG+ IRQ++ TL ++V 

Sbjct: 1 MKIAHPAFAGTLESSDLQVRIEPHNDGGIELVLDSTVEQQFGHAIRQVVLHTLDAMQVRD 60 

Query: 61 AKVEAVDKGALDCTIQARTIAAVHRAAGIDQYDWKEI 97 
A V DKGALDC I+AR AAV RA + +W ++ 
10 Sbjct: 61 ALVTIEDKGALDCVIRARVQAAVMRACDVQNIEWSQL 97 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2724 

15 A DNA sequence (GASxll81) was identified in S.pyogenes <SEQ ID 7947> which encodes the amino acid 
sequence <SEQ ID 7948>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

>» Seems to have no N-terminal signal sequence 
20 INTEGRAL Likelihood = -1.65 Transmembrane 74 - 90 ( 74 - 90) 



Final Results 

bacterial membrane Certainty=0 . 1659 (Affirmative) < succ; 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

25 bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA71632 GB:Y10621 CI LB, citryl-CoA lyase beta subunit 
[Leuconostoc mesenteroides] 
Identities = 187/293 (63%), Positives = 237/293 (80%), Gaps = 1/293 (0%) 

ERLRRTMMFVPGANAAMLRDAPLFGADSIMFDLEDSVSLKEKDTSRALVHFALKTFDYSS S 1 
ERLRRTMMFVPG N AM++DA +FGADSIMFDLED+VSL EKD++R LV+ AL+T DY S 
ERLRRTMMFVPGNNPAMVKDAGI FGADSIMFDLEDAVSLAEKDSARYLVYEALQTVDYGS 6 3 

VETWRVNGLDS - CGALDIEAWLAGVNVIRLPKTETAQDIIDVEAVIERVERENS IEVG 120 
E WR+NGLD+ DI+A+V AG++VIRLPK ETA + ++E++I E+E VG 



r MMAAIESA GV+NA EIA AS R+IGIAL AEDY T+MKT RYPDGQEL +AR+ + IL 



HAARAAGIAA DTV++++N+ EGF E ++I QLGFDGKS+INPRQI +VN++Y PT+KE 



I++A+ VI AI EA+ KGSGVIS+NG+MVD+P+V RA+RV+ LA A ++ E 





2 


Sbjct: 




Query: 


62 


Sbjct: 


64 


Query: 


121 


Sbjct: 


124 


Query: 


181 


Sbjct: 


184 


Query: 


241 


Sbjct: 


244 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2725 

A DNA sequence (GASxll82) was identified in S.pyogenes <SEQ ID 7949> which encodes the amino acid 
sequence <SEQ ID 7950>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

5 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm -— Certainty=0. 3554 (Affirmative) < suco 

10 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

15 >GP:CAA71633 GB:Y10621 CILA, citrate CoA-transf erase alpha subunit 

[Leuconostoc mesenteroides] 
Identities = 294/511 (57%) , Positives = 378/511 (73%) , Gaps = 7/511 (1%) 

Query: 4 NKLGRDIPQPYADQY--GVFEGELANIKQYDESSRRIKPVKPGDSKLLGSVREAIEKTGL 61 
20 NK+ D+P +Q VFE + +++ G+SK+ S+ + + T L 

Sbjct: 3 NKVNIDVPDAI LEQLDDS VFESTNYGNPE I QRVGPKVRATT - GESKVQSSIDD VLSNT - L SO 

Query: 62 TDGMTISFHHHFREGDFIMNMVLEEIAKMGIKKLSIAPSSIANV-HEPLIDHIKNGWTN 120 
D3MTISFHHHFREGDF+ N V+ +1 MG +NL++APSS+ KV ++ +1+ IK GWTN 
25 Sbjct: 61 KDGMTISFHHHFRHGDFVFNKVMRKIIDMGYQNLTIjAPSSLTNVMNDIVIEAIKKGVvTO 120 

Query: 121 ITSSGLRDKVGAAISEGLMENPWIRSHGGRARAIASGDIHIDVAFLGAPSSDAYGNVNG 180 

ITSSG+R +G A+S G+++NPV+ RSHG RARAI SG+I IDVAFLG P+SD GN NG 
Sbjct: 121 ITSSGMRGTLGDAVSHGILKNPVIFRSHGARARAIESGEIKIDVAFLGVPNSDEMGNANG 180 

30 

Query: 181 TKGKATCGSLGYAMIDAKYADQWILTDNLVPYPNTPIS I PQTDVDYWTVDAIGDPQGI 240 

G A GSLGYA+IDA+YAD++V++TD ++PYPNTP SI QT VDYW VD +GDP I 
Sbjct: 181 MNGDAAFGSLGYALIDAQYADKLVLITDTIMPYPNTPASIKQTQVDYWKVDKVGDPDKI 240 

35 Query: 241 AKGATRFTKNPKELLIAEYAAKVITNSPYFKEGFSFQTGTGGASLAVTRFMREAMIKENI 300 

GATRFTK+PKEL IA+ VI NS YFK FSFQTG+GGA1-LAVTRF+REAM+ +NI 
Sbjct: 241 GSGATRFTKDPKELKIAKTVNDVIVNSKYFKNDFSFQTGSGGAAIAVTRFLREAMMAQNI 300 

Query: 301 KASFALGGITNAMVELLEEELVEKILDVQDFDHPSAVSLGKHAEHYEIDANMYASPLSKG 360 
40 ASFALGGIT V+LL E LV +++DVQDFD +A S+ EIDA+ YA P +KG 

Sbjct: 301 ^SFALGGITKPTvDLLNEGLVNRWJDVQDFDKGAASSMKLSPNQQEII^SWYADPANKG 360 

Query: 361 AVINQLDTCILSALETOTNFNVNVMTGSDGVIRGASGGHCDTAFAAKMSLVISPLIRGRI 420 
A++++LD ILSALEVDTNFNVNVM+GSDGVIRGA GGH D A AK++++ PL+RGRI 
45 Sbjct: 361 AMVTDKLDVAILSALEVDTNFNVNVMSGSDGVIRGAIGGHQDAA-TAKLTIISVPLVRGRI 419 

Query: 421 PTFVDEVNWITPGTSVDVIVTEVGIAINPNRQDIjVDHFKSL-NVPQFSIEELKEJCAYAI 479 

T V +VNTVITPG S+DV+VTEVGIAINP R DLV+ K + +P +SIEEL+4-KA I 
Sbjct: 420 ATIVPKVNTVITPGDSIDVVVTEVGIAINPKRTDLVEQLKQVPGLPIYSIEELQQKAEKI 479 

50 

Query: 480 VGTPERIQYGDKWALIEYRDGSLMDWYNV 510 

VG P +++ D+WA4 EYRDGS++D++ V 
Sbjct: 480 VGQPAPLKFTDRWAVAEYRDGSVIDIIKEV 510 

55 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2726 

A DNA sequence (GASxl 183) was identified in S.pyogenes <SEQ ID 795 1> which encodes the amino acid 
sequence <SEQ ID 7952>. Analysis of this protein sequence reveals the following: 
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Possible site: 13 

>» Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA71634 GB:Y10621 CILG, hypothetical protein [Leuconostoc 
mesenteroides] 

Identities = 65/176 (36%), Positives = 97/176 (54%), Gaps = 3/176 (1%) 

Query: 21 DTYFSGEAIQLSDMLRAREERALRQLHLLKEYPEGSLLSVTMNIPGPIKTSPKLLEAFDI 80 

D + GE + L +L RE R Q L+ +P + SV +N+PGPIKTSPKL F I 
Sbjct: 2 DYFEGGERim,MQvLDNREITOEKyQKQL^SFPTAVITSVKLNLPGPIKTSPKLQSVFQI 61 

Query: 81 VIKAIQTAIADDKICYQLRLL-PTTGYEYYLITSLPSRDLKLKMIALETELPIGRLMDBD 139 

+ 1 + D +1 + + TG + + +TS + +K MI E +GRL+DLD 

Sbjct: 62 IINDLNPVFKDLQIIKEASFVDQITGPDIFFVTSGCLKLVKQIMITFEESHLLGRLLDLD 121 

Query: 140 VLVLQNDLPHSISRTVLGGSPRQCFICSKEAKVCGRLRKHSVEEMQTAISKLLHSF 195 

V+ D +SR LG +PR+C +C K+AK C + HS+ E + I+K+LH+F 
Sbjct: 122 VMCQNAD - - KQLSREELGFAPRKCLLCGKDAKTC I KEGNHSLAEGYSQINKMLHNF 175 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2727 

A DNA sequence (GASxl 184) was identified in S.pyogenes <SEQ ID 795 3> which encodes the amino acid 
sequence <SEQ ID 7954>. Analysis of this protein sequence reveals the following: 
Possible site: 58 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .3730 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB99233 GB:U67563 oxaloacetate decarboxylase alpha chain (oadA) 
[Methanococcus jannaschii] 
Identities = 245/441 (55%) , Positives = 336/441 (75%) , Gaps = 5/441 (1%) 

Query: 10 IRITETVLRDGQQSQIATRMTTKEMIPrLETLDNAGYHALEMWGGATFDSCLRFLNEDPW 69 

++I +T RD QQS IATRM T++M+PI E +D G++++E+WGGATFD+C+R+LNEDPW 
Sbjct: 2 VKIVDTTFRDAQQSLIATRMRTEDMLPIAEKMDEVGFYSMEVWGGATFDACIRYLNEDPW 61 

Query: 70 ERLRAIRKAVKKTKLQMLLRGC^IfiYRNYA 129 

ERLRA++K ++ T LQMLLRGQNL+GYR+Y DD+V F+ K+ ENGIDI RIFDALND R 
Sbjct: 62 ERLRALKKRIQNTPLQMLLRGQNLVGYRHYPDDrVEKWIKAHFJJGIDIFRIFDALNDVR 121 

Query: 130 NLQTAVSATKKFGGHAQVAISYTTSPVHTIDYFVELAICAYQAIGADSICIKDMAGVLTPE 189 

N++TA+ KK G Q AI YT SPVHTID +VELAK + +G DS I C I KDMAG+LTP 
Sbjct: 122 NMETAIKTAKKVGAEVQGAICYTISPVHTIDQYVELAKKLEEM3CDSICIKDMAGLLTPY 181 
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Query: 190 IGYQLVKCIKENTTIPLEVHTHATSGISEMTYLKVAEAGADIIDTAISSFSGGTSQPATE 249 

GY+LVK +KE ++P++VH+H TSH++ MTYLKV EAGAD++D AIS F+ GTSQP TE 
Sbjct: 182 EGYELVKRLKEEISLPIDVHSHCT8GLAPMTYLKVIE3VGADMVDCAISPFAMGTSQPPTE 241 

Query: 250 SMAIALTDLGFDTGLDMQEVAKVAEYFNTIRDHYRE1GILNPKVKDTEPKTLIYQVPGGM 309 

S+ +AL +DTGLD++ + ++ +YF +R+ Y+ + +P + + + L+YQVPGGM 
Sbjct: 242 S I WALKGTKYDTGLDLKLLNEIRDYEMKVREKYKM — LFS P I SQI VDARVLVYQVPGGM 299 

Query: 310 LSNLLSQLTEQGLTDKYEEVLAEVPKVRaDLGYPPLVTPLSQMVGTQALMNI 1 SGERYKV 369 

LSNL+SQL EQG DK+EEVL E+P+VR DLGYPPLVTP SQ+VGTQA++N+++ ERYK+ 
Sbjct: 300 LSMjVSQLKEQGALDKFEEVLQEIPRVRKDLGYPPLVTPTSQIVGTQAVLNVLTEERYKI 359 

Query: 370 VPNEIKDYVRGLYGQSPAPLAEGIKEKI1GD-EAVITCRPADLIEPQMIYLRDEIAP--Y 426 

+ NE+ +YV+G YG+ PAP+ + ++++ + E ITCRPADL+ P+ ++ E 
Sbjct: 360 ITNEVVWYVKGFYGKPPAPINPELLKRVLDEGEKPITCRPADLLPPEWEKVKKEAEEKGI 419 

Query: 427 AHSEEDVLSYASFPQQARDFL 447 

EED+L+YA +PQ A Fh 
Sbjct: 420 VKKEEDILTYALYPQIAVKFL 440 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2728 

A DNA sequence (GASxll85R) was identified in S.pyogenes <SEQ ID 7955> which encodes the amino 
acid sequence <SEQ ID 795 6>. Analysis of this protein sequence reveals the following: 
Possible site: 40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2497 (Affirmative) < auco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF93960 GB:AE004165 citrate (pro-3S) -lyase ligase [Vibrio choleras] 
Identities = 118/336 (35%) , Positives = 183/336 (54%) , Gaps = 5/336 (1%) 

Query: 4 YTISKVFPSDKTTMASVKNLLHQEGIRLDAHLDYTCAIMNAQNDVIATGSYFGNSLRCLC 63 

YT S+V ++T + +K L Q + +D +++ + N +IA G G+ L+ + 
Sbjct: 10 YTFSRVSTKNRTKLLQIKEFLCQHQLTVDDDVEHF-VVAYGTNQIIACGGIAGHVLKSIA 68 

Query: 54 VSSAYCGEGLI^IVSHLIDEEY7ALGNYHLFVYTKTSSAAFFKDLGFTEIVHIDNHISFL 123 

VS A QG G ++++ L + Y +G + LF++TK ++ F+ GF + ++ HI+ L 
Sbjct: 69 VSPALCGTGFALKLMTELTNFAYEMGRFSLFLFTKPANIDLFRQCGFFLVDKVEPHIALL 128 

Query: 124 ENKKTGFQDYLMTIMKPEQTPGKVA^IVINANPFTLGHQFLVEKAAREIro^fVHLF^W'SED 183 

EN Y h + + K+ +IV+NANPFTLGEQ+L+E+A + DWvHLF+V + 

Sbjct: 129 ENSPNRLSWCKQLQLLKMSGRKIGSIVMNANPFTLGHQYLIEQACEQCDWVHLFWKAE 188 

Query: 184 RSLIPFSVRKRLIQEGLAHLDNVIYHETGPYLISQATFPAYFQI^EDNDVIKSQALLDTAI 243 

++ R +1+ G HLN+ H Y+IS+ATFP+YF K+ V +S LD +1 
Sbjct: 189 NKDFSYADRMAMIKAGSKHLLNLTIESGSDYIISRATFPSYFIKDQQVVNQSHTALDLSI 248 

Query: 244 FL-KIAQTLQITKRYVGEEPTSRVTAIYNEIM AEQLQQAGILLDILPRKAINQQQDP 299 

F IA L IT R+VG EP VT YN+ M E+ A + ++ + Q p 
Sbjct: 249 FRHSIAPALGITHRrA?GSEPICTVTRHYNQAMRRWLEEAHDASAPIQVVEIERSQQASQP 308 

Query: 300 ISASTARQALKDNDroLLAKDLPKTSLDYFCSIiKAQ 335 
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ISAS R LK + +A L+PKT+ Y C A+ 
Sbjct: 309 I SASRWYLLKQFGFAAIADLVPKTTYSYLCQHYAE 344 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2729 

A DNA sequence (GASxll87) was identified in S.pyogenes <SEQ ID 7957> which encodes the amino acid 
sequence <SEQ ID 7958>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

10 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4790 (Affirmative) < suco 

15 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2730 

A DNA sequence (GASxll88R) was identified in S.pyogenes <SEQ ID 7959> which encodes the amino 
acid sequence <SEQ ID 7960>. Analysis of this protein sequence reveals the following: 

25 Possible site: 21 

»> Seems to have no N-terminal signal sequence 

Final Results 

30 bacterial cytoplasm --- Certainty=0. 3956 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2731 

A DNA sequence (GASxl 190) was identified in S.pyogenes <SEQ ID 796 1> which encodes the amino acid 
40 sequence <SEQ ID 7962>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

»> Seems to have no N-terminal signal sequence 

45 Final Results 

bacterial cytoplasm — Certainty=0. 1274 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 {Not Clear) < suco 
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No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2732 

A DNA sequence (GASxll96R) was identified in S.pyogenes <SEQ ID 7963> which encodes the amino 
acid sequence <SEQ ID 7964>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

10 

>» Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

15 bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2733 

A DNA sequence (GASxl21 1) was identified in S.pyogenes <SEQ ID 7965> which encodes the amino acid 
sequence <SEQ ID 7966>. Analysis of this protein sequence reveals the following: 

25 Possible site: 15 

»> Seems to have no N-terminal signal sequence 

Final Results 

30 bacterial cytoplasm Certainty=0. 1850 (Affirmative) < succ> 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2734 

A DNA sequence (GASxl219R) was identified in S.pyogenes <SEQ ID 7967> which encodes the amino 
40 acid sequence <SEQ ID 7968>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

>» Seems to have no N-terminal signal sequence 

45 Final Results 

bacterial cytoplasm Certainty=0. 2284 (Affirmative) < suco 
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bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

5 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2735 

A DNA sequence (GASxl225) was identified in S.pyogenes <SEQ ID 7969> which encodes the amino acid 
10 sequence <SEQ ID 7970>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

»> Seems to have no N-terminal signal sequence 

15 Final Results 

bacterial cytoplasm Certainty=0 .2062 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

20 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2736 

25 A DNA sequence (GASxl229) was identified in S.pyogenes <SEQ ID 7971> which encodes the amino acid 
sequence <SEQ ID 7972>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

»> Seems to have no N-terminal signal sequence 

30 

Final Results 

bacterial cytoplasm Certainty=0 . 2755 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2737 

A DNA sequence (GASxl247R) was identified in S.pyogenes <SEQ ID 7973> which encodes the amino 
acid sequence <SEQ ID 7974>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

45 »> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -6.32 Transmembrane 55 - 71 ( 53 - 81) 
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Likelihood = -6.00 Transmembrane 74 - 90 ( 72 - 95) 
Likelihood = -2.18 Transmembrane 95 - 111 ( 95 - 111) 
Likelihood = -1.54 Transmembrane 124 - 140 ( 123 - 141) 

Final Results 

bacterial membrane Certainty=0. 3527 (Affirmative) < succ 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



RTLKMTLATIVAILIAYQLHLDYAMSAC-I IALLSVLDTRKSSLWARNRLLSFFLAFGIA 65 
RT+K L T +AI 1+ LHL SAGII +L + T+K SL + R + LA + 
RTIKTALGTALAIYISQLLHLQNFASAGIITILCIQITQKRSLQASWARFWACCLAIAFS 66 





6 


Sbjct: 




Sbjct: 


67 




126 


Sbjct: 


127 




186 


Sbjct: 


187 




246 


Sbjct: 


247 




306 


Sbjct: 


307 



L VG VALL N YM D+++ Y +K+E -i 



LP TREEFE RA LF LIi ++E+++ 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2738 

A DNA sequence (GASxl261) was identified in S.pyogenes <SEQ ID 7975> which encodes the amino acid 
sequence <SEQ ID 7976>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 . 6082 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2739 

A DNA sequence (GASxl262R) was identified in S.pyogenes <SEQ ID 7977> which encodes the amino 
acid sequence <SEQ ID 7978>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

5 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -7.06 Transmembrane 38 - 54 ( 37 - 55) 

Final Results 

10 bacterial membrane Certainty=0 .3824 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

15 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2740 

A DNA sequence (GASxl265R) was identified in S.pyogenes <SEQ ID 7979> which encodes the amino 
20 acid sequence <SEQ ID 7980>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

»> Seems to have a cleavable N-term signal seg. 

25 Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2741 

A DNA sequence (GASxl270) was identified in S.pyogenes <SEQ ID 798 1> which encodes the amino acid 
35 sequence <SEQ ID 7982>. Analysis of this protein sequence reveals the following: 

Possible site: 36 

»> Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm Certainty=0. 4063 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2742 

A DNA sequence (GASxl290R) was identified in S.pyogenes <SEQ ID 7983> which encodes the amino 
acid sequence <SEQ ID 7984>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-12.37 Transmembrane 180 - 196 ( 172 - 207) 
INTEGRAL Likelihood =-10.19 Transmembrane 34 - 50 ( 30 - 53) 
INTEGRAL Likelihood = -4.09 Transmembrane 233 - 249 ( 232 - 250) 

Pinal Results 

bacterial membrane Certainty=0. 5946 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB88010 GB-.L21856 MalA [Streptococcus pneumoniae] 
Identities = 66/237 (27%), Positives = 105/237 (43%), Gaps = 28/237 (11%) 

Query: 45 MIPVTLHYANMTTYPLERIVTKSLSPITDKTYQALTQGKIEKD TFQGQSLIRRD 98 

M+P+ + ++ TYPLE + P+TDK Q L++ D T+ G + 

Sbjct: 1 MTOIAIQNSSQETYPLETFIDIJVYEPLTDKWQDLSEHATIVDGTLTYTGTASQAPSWI 60 

Query: 99 GELVLAVLPTICVDLEQLASESTRQIIVTKKEWRPVTPDGIQ3L-RAHVRGQQQSLADLTTV 157 

G + LP + h T +++++K + KEL R R Q T 
Sbjct: 61 GPSQIKELPKDLQLHF DTNELVISK - -ESKELTRISYRAIQ TEG 102 

Query: 158 KAVKDFVNQQWY DSNKASVLGFLLLTFVLMVCVGTLIVIGLGAFFLTLTKRSRLFMI 214 

KD + Q + +N+ + FL+L + + IV L +TK+SRLF 

Sbjct: 103 FKSKDSLTQAFIRLVPTNRVYISLFLVLGASFLFGLNFFIVSLGACLLLYITKKSRLFSF 162 

Query: 215 RNFSEGLGLMTOCLAWPSLLAIALSFFIQDPVLIMNCQVFGTLLMLTWVFYKTQFRD 271 

R F E ++NCL P+L+ + L F Q+ ++ Q +L L +FYKT FRD 
Sbjct: 163 RTFKECYHFILNCLGLPTLITLILGLFGONMTTLITVQNILFVLYLVTIFYKTHFRD 219 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2743 

A DNA sequence (GASxl294) was identified in S.pyogenes <SEQ ID 7985> which encodes the amino acid 
sequence <SEQ ID 7986>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2104 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2744 

A DNA sequence (GASxl303R) was identified in S.pyogenes <SEQ ID 7987> which encodes the amino 
acid sequence <SEQ ID 7988>. Analysis of this protein sequence reveals the following: 

Possible site: 38 

ui uncleavable N-term signal seq 

) - 38) 



Final Results 

bacterial membrane Certainty=0. 4227 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

Example 2745 

A DNA sequence (GASxl307R) was identified in S.pyogenes <SEQ ID 7989> which encodes the amino 
acid sequence <SEQ ID 7990>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>» Seems to have a cleavable N-term signal seg. 

Final Results 

bacterial outside -— Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2746 

A DNA sequence (GASxl312R) was identified in S.pyogenes <SEQ ID 7991> which encodes the amino 
acid sequence <SEQ ID 7992>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1996 (Affirmative) < suco 

bacterial membrane Certainty=0.0000(Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2747 

A DNA sequence (GASxl316R) was identified in S.pyogenes <SEQ ID 7993> which encodes the amino 
acid sequence <SEQ ID 7994>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 3504 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

RGD motif: 271-273 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC66321 GB:AE000792 outer surface protein, putative [Borrelia 
burgdorferi] 

Identities = 127/365 (34%) , Positives = 195/365 (52%) , Gaps = 14/365 (3%) 

Query: 1 MVDLGFSLYPERYDVTKSKAYIDLCHSYGAKRLFMSLLQLAPADHQMFHCYAELIAYANQ 60 

M ++G S+YP K Y++ +G ++F Shh + + F + EL++ AN+ 
Sbjct: 1 MKEIGI S I YPNVSPKNKI I KYLEKSAHFGFTQVFTSLLYI NGNEFDIFKELLSIANK 57 

Query: 61 LGIRVIADVSPSFISQAGWSDQLIERA- HAFGLAGLRLDEALPLAEIVTLTRNPF 114 

G++ I DVSP + G + G +RLD E +T N 

Sbjct: 58 NGMKPIIDVSPEIFKELGIDLSNLRNCPKLDYFKKLGAWAIRLDNTFTGIEESLMTFNDS 117 

Query: 115 GLKIELNMSTDKQLLMSLIATDAERSNIIGCHNFYPHEFTGLSWQHFKDMSRFYHEHDIE 174 

LKI+LN+S + + +++ N++GCHNFYPH++TGLS FK+ ++ + +1 

Sbjct: 118 DLKI QLNI SNINKHI DT IMYFKPNI KNLLGCHNFYPHKYTGLSRNFFKETTKI FKHYS I P 177 

Query: 175 TAAFITAQSASE-GPWLLAEGLPTVEDHRHLPIGLQVBLMKAIGTIDNILISNQFISEEE 233 

TAAFI++ +A E EG+PT+E HR I Q + + G ID +LISN F SE E 

Sbjct: 178 TAAFISSNNAEECARGKEKEGVPTLESHRSKDIETQAKDLFKEG-IDTVLISNCFPSETE 236 

Query: 234 LAACTQAIiARPVTTIKVRPIIDLTEVEEQII-GYPHCYRGDVSDYVIRSTMPRLVYAQES 292 

L ++ + R + +K D VE++II H RGD++ Y IRSTMPR+ Y + 

Sbjct: 237 LKIWSK-VNRNILELKADmPDANSVEKEIILENLHFNRGDINSYRIRSTMPRVYYNNKK 295 

Query: 293 IAPRDQSKEVKRGSIIIDNDRYHRYKGELQIALKNFTVSSKANWAEVREDYLSLLDDLR 352 

P E+K+G I + ID+ Y Y GELQIALK+ + HW ++ D + LL+ + 

Sbjct: 296 F-PVHSPNEIKKGDILIDSSEYLGYTGELQIALKDTPNNGLVNWGKIINDEIYLLEKIE 354 

Query: 353 PWQEF 357 

Sbjct: 355 PWEKF 359 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2748 

A DNA sequence (GASxl319) was identified in S.pyogenes <SEQ ID 7995> which encodes the amino acid 
sequence <SEQ ID 7996>. Analysis of this protein sequence reveals the following: 

Possible site: 34 
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Likelihood = -7 

INTEGRAL Likelihood = -5 

INTEGRAL Likelihood = -1 

INTEGRAL Likelihood = -0 



33 ( 15 - 

55 ( 36 - 

76 ( 59 - 

59 Transmembrane 85 - 101 ( 85 - 101) 



Final Results 

bacterial membrane Certainty=0. 4800 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2749 

A DNA sequence (GASxl320) was identified in S.pyogenes <SEQ ID 7997> which encodes the amino acid 
sequence <SEQ ID 7998>. Analysis of tins protein sequence reveals the following: 

Possible site: 45 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -1.81 Transmembrane 35 - 51 ( 35 - 51) 

Final Results 

bacterial membrane Certainty=0. 1723 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2750 

A DNA sequence (GASxl321) was identified in S.pyogenes <SEQ ID 7999> which encodes the amino acid 
sequence <SEQ ID 8000>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

>» Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2751 

A DNA sequence (GASxl329) was identified in S.pyogenes <SEQ ID 8001> which encodes the amino acid 
sequence <SEQ ID .8002>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

5 

>>> Seems to have no N- terminal signal sequence 

INTEGRAL Likelihood = -1.28 Transmembrane 64 - 80 ( 64 - 80) 

Final Results 

10 bacterial membrane Certainty=0 . 1510 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

15 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2752 

A DNA sequence (GASxl332R) was identified in S.pyogenes <SEQ ID 8003> which encodes the amino 
20 acid sequence <SEQ ID 8004>. Analysis of this protein sequence reveals the following: 

Possible site: 37 

>>> Seems to have an uncleavable N-term signal seq 

25 Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certaiaty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2753 

35 A DNA sequence (GASxl333) was identified in S.pyogenes <SEQ ID 8005> which encodes the amino acid 
sequence <SEQ ID 8006>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

>» Seems to have an uncleavable N-term signal seq 

40 

Final Results 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2754 

A DNA sequence (GASxl335R) was identified in S.pyogenes <SEQ ID 8007> which encodes the amino 
acid sequence <SEQ ID 8008>. Analysis of this protein sequence reveals the following: 

Possible site: 37 

»> Seems to have an uncleavable N-term signal seq 



- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

; phosphorylase [Vibrio cholerae] 
Ltives = 72/167 (42%), Gaps = 12/167 (7%) 



Query: 8 GVKEMISTGTCGVLVP-I/ 

G K ++ G+ G + I ++ A+RDEG S Y+ + +++ +++ h 

Sbjct: 79 GAKAIVRVGSAGAMQSEIGLGELILVEGAVRDEGGSJCAYIGAAYPAYSSFELWEMQRFL 138 

Query: 67 IAC^LAYQEVrTWSTDGFYR-ETKEKVAHRQEEGCSVVEKECSALAAVAQLRG IL 120 

Q + I S D FY E E + +G +ME SAL V +LRG +L 

Sbjct: 139 AEQSVPIHRGITOSHDSFYTDEEAELCRYWHRKGILAADMETSALLTVGRLRGLQVASVL 198 

Query: 121 WGQLLFTADTIiADVEVY DQRNWGADSFSFALHLCLEVLNTLEKD 164 

+L+ D A V Y DQR + + A L LN L+ D 

Sbjct: 199 NNWLYEQDVQAGWQYVNADQRMMQGE- -TLAARA7ALHALNALKFD 243 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2755 

A DNA sequence (GASxl353) was identified in S.pyogenes <SEQ ID 8009> which encodes the amino acid 
sequence <SEQ ID 8010>. Analysis of this protein sequence reveals the following: 

Possible site: 42 

»> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -5.79 Transmembrane 241 - 257 ( 234 - 2S0) 
INTEGRAL Likelihood = -5.15 Transmembrane 44 - 60 ( 43 - 65) 
INTEGRAL Likelihood = -4.78 Transmembrane 74 - 90 ( 72 - 92) 

Final Results 

bacterial membrane Certainty=0 .3314 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 
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Example 2756 

A DNA sequence (GASxl354R) was identified in S.pyogenes <SBQ ID 801 1> which encodes the amino 
acid sequence <SEQ ID 8012>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

5 

»> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -3.45 Transmembrane 68 - 84 ( 65 - 86) 

Final Results 

10 bacterial membrane Certainty=0. 2381 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 {Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
15 The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB83831 GB:AL162753 putative integral membrane protein 
[Neisseria meningitidis] 
Identities - 31/72 (43%) , Positives = 46/72 (63%) , Gaps = 6/72 (8%) 

20 Query: 17 EVIYAFDKRKAIKKKRRISERKLLVITVLFGGF-GALLAAKKYHHKTRKWyFVI TC 71 

F +Y DKR+A++- KRRI E +LL + LFGG+ GA L ++ + HKT K FV+ T 
Sbjct: 38 FALYGIDKRRAVRGKRRIPEHRLL-LPALFGGWAGAYLGSRIFRHKTAKKRFWLFRLTV 96 

Query: 72 YTSILLTLLVTY 83 
25 ++L TL++ Y 

Sbjct: 97 SGNVLATLILIY 108 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

30 Example 2757 

A DNA sequence (GASxl363R) was identified in S.pyogenes <SEQ ID 8013> which encodes the amino 
acid sequence <SEQ ID 8014>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

35 »> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

40 bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
45 antigens for vaccines or diagnostics. 

Example 2758 

A DNA sequence (GASxl367) was identified in S.pyogenes <SEQ ID 8015> which encodes the amino acid 
sequence <SEQ ID 801 6>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

50 

»> Seems to have an uncleavable N-term signal seq 
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Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MPRKTFDICAFKLSAVKLILEEEQPVKHVSSTLEIHPNSLYQWIQEYEKYGESAFPGHGSA 60 

M R+ FDK FK SAVKLILEE VK VS LE+H NSLY+W+QE E+YGESAFPG+G+A 
Sbjct: 1 MARRKFDKQFKNSAVKLILEEGYSVKEVSQELEVHANSLYRWVQEVEEYGESAFPGNGTA 60 

Query: 61 LRHAQFKTKKLEKEHKLLQEELALLKKFQVFLKPNR 96 

L 4AQ K K LEKE++ LQEEL LLKKF+VFLK ++ 
Sbjct: 61 LANAQHKIKLLEKENRYLQEELELLKKFRVFLKRSK 96 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2759 

A DNA sequence (GASxl374R) was identified in S.pyogenes <SEQ ID 8017> which encodes the amino 
acid sequence <SEQ ID 801 8>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0 .25B5 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2760 

A DNA sequence (GASxl382R) was identified in S.pyogenes <SEQ ID 8019> which encodes the amino 
acid sequence <SEQ ID 8020>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

n signal seq 

3 - 19 ( 3-19) 



Final Results 

bacterial membrane Certainty=0, 1956 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2761 

5 A DNA sequence (GASxl391R) was identified in S.pyogems <SEQ ID 8021> which encodes the amino 
acid sequence <SEQ ID 8022>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

»> May be a lipoprotein 

10 

Final Results 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2762 

A DNA sequence (GASxl404) was identified in S.pyogenes <SEQ ID 8023> which encodes the amino acid 
sequence <SEQ ID 8024>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

25 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3 04 6 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2763 

A DNA sequence (GASxl412R) was identified in S.pyogenes <SEQ ID 8025> which encodes the amino 
acid sequence <SEQ ID 8026>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

40 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — - Certainty=0. 1590 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2764 

A DNA sequence (GASxl414R) was identified in S.pyogenes <SEQ ID 8027> which encodes the amino 
acid sequence <SEQ ID 8028>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

10 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2816 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2765 

A DNA sequence (GASxl416) was identified in S.pyogenes <SEQ ID 8029> which encodes the amino acid 
sequence <SEQ ID 8030>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1744 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2766 

A DNA sequence (GASxl417) was identified in S.pyogenes <SEQ ID 8031> which encodes the amino acid 
sequence <SEQ ID 8032>. Analysis of this protein sequence reveals the following: 

40 Possible site: 13 

»> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm — Certainty=0. 3771 (Affirmative) < suco 
bacterial membrane Certainty=0. 0000 (Not Clear) < suco 
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bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2767 

A DNA sequence (GASxl419R) was identified in S.pyogenes <SEQ ID 8033> which encodes the amino 
acid sequence <SEQ ID 8034>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

in uncleavable N-term signal seq 

20 ( 1 - 25) 

Final Results 

bacterial membrane Certainty=0. 5373 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2768 

A DNA sequence (GASxl423) was identified in S.pyogenes <SEQ ID 8035> which encodes the amino acid 
sequence <SEQ ID 8036>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -8.97 Transmembrane 30 - 46 ( 25 - 49) 
INTEGRAL Likelihood = -7.80 Transmembrane 52 - 68 ( 50 - 72) 
INTEGRAL Likelihood = -6.95 Transmembrane 129 - 145 ( 125 - 146) 

Final Results 

bacterial membrane Certainty=0. 4588 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2769 

A DNA sequence (GASxl426R) was identified in S.pyogenes <SEQ ID 8037> which encodes the amino 
acid sequence <SEQ ID 8038>. Analysis of this protein sequence reveals the following: 
Possible site: 25 
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Final Results 

bacterial membrane Certainty=0 .2381 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC39287 GB:AF115103 orf87 gp [Streptococcus thermophilus 
bacteriophage S£i21] 
Identities = 43/73 (58%) , Positives = 61/73 (82%) 

Query: 1 MINLKLRLQNKATTLmiLGAIFLIAQQLGlKLPSNIADIANTAvTLLVLLGVVTDPTTKG 60 

MIN KLRLQNK TL+A++ A+FL+ QQ G+ +P+NI + NT V +LV+LG++TDPTTKG 
Sbjct: 8 MINFKLRLQNKATLVALISAVFLMLQQFGLHVPNNIQEGINTLVGILVILGIITDPTTKG 67 



Query: 61 L 

++DSE+AL+Y +P 
Sbjct: 68 IADSERALSYIQP 80 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2770 

A DNA sequence (GASxl427R) was identified in S.pyogenes <SEQ ID 8039> which encodes the amino 
acid sequence <SEQ ID 8040>. Analysis of this protein sequence reveals the following: 
Possible site: 27 



Final Results 

35 bacterial membrane Certainty=0. 2211 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

40 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2771 

A DNA sequence (GASxl428R) was identified in S.pyogenes <SEQ ID 8041> which encodes the amino 
45 acid sequence <SEQ ID 8042>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

»> Seems to have no N-terminal signal sequence 

50 Final Results 

bacterial cytoplasm Certainty=0 .1017 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2772 

A DNA sequence (GASxl429R) was identified in S.pyogenes <SEQ ID 8043> which encodes the ammo 
acid sequence <SEQ ID 8044>. Analysis of this protein sequence reveals the following: 
Possible site: 46 

10 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3097 (Affirmative) < suco 

15 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2773 

A DNA sequence (GASxl431R) was identified in S.pyogenes <SEQ ID 8045> which encodes the amino 
acid sequence <SEQ ID 8046>. Analysis of this protein sequence reveals the following: 

25 Possible site: 50 

»> Seems to have no N-terminal signal sequence 

Final Results 

30 bacterial cytoplasm Certainty=0 .2584 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA98101 GB:M19348 hyaluronidase [Streptococcus pyogenes phage 
H4489A] 

Identities = 337/371 (90%), Positives = 351/371 (93%), Gaps = 1/371 (0%) 

Query: 1 MAENIPLRVQFKRNKAAEWASSDWLLEGEIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 60 

M ENIPLRVQFKRM A EWA SDV+LLEGEIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 
SbjCt: 1 MTENIPLRVQFKRMSADEWARSDVILLEGEIGFETDTGFAKFGDGQNTFSKLKyLTGPKG 60 

Query: 61 PKGDTGLQGKrGGTGSRGPAGKPGTTDYDQLQNKPDLGAFAQKEETNSKITKLESSKADK 120 

PKGDTGLQGKTGGTG RGPAGKPGTTDYDQLQNKPDLGAFAQKEETNSKITKLESSKADK 
Sbjct: 61 PKGDTGLQGKTGGTGPRGPAGKPGTTBYDQLQKKPDLGAFAQKEETNSKITKLESSKADK 120 

Query: 121 NAVYLI<AESNAKLDEKLNLKGGVMTGQLQFKPN-SG1KPSSSVGGAINIDMSKSEGAAMV 179 

+AVY KAES +LD+KL+L GG++TGQLQFKPN SGIKPSSSVGGAINIDMSKSEGAAMV 
Sbjct: 121 SAVYSKAESKIELDKKLSLTGGIVTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAAMV 180 
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Query: 180 MYTNKDTTDGPLMILRSNKDTFDQSVQF\TOKGTTNAVNIVMRQPTTPNFSSaLNITSAN 239 

MYTNKDTTDGPLMILRS+KDTFDQS QFVDY G TNA.VNIVMRQP+ PNFSSALNITSAN 
Sbjct: 181 MYTNKDTTDGPLMILRSDKDTFDQSAQFVDYSGKTNAVNIVMRQPSAPNFSSAENITSMI 240 

Query: 240 EGGSAMQIRGVEKALGTLKITHENPSVDKEYDENAAALSIDIVKKQKGGKGTAAQGIYIN 299 

EGGSAMQIRGVEKALGTLKITHENP+V+ +YDENAAALSIDIVKKQKGGKGTAAQGIYIN 
Sbjct: 241 EGGSAMQIRGVEKALGTLKITHENPNVEAKYDENAAALSIDIVKKQKGGKGTAAQGIYIN 300 

Query: 300 STSGTAGKMLRIRNKNKDKFYVGPDGDFWSCASSIVDGNLTVKDPTSGKHAATKDYVDEK 359 

STSGTAGKMLRIRNKN+DKFYVGPDG F S A+S V GNLTVKDPTSGKHAATKDYVDEK 
Sbjct: 301 STSGTAGKMLRIRNKISEDKFWGPDGGFHSGANSTVAGNL.TVKDPTSGKHAATKDYVDEK 360 

Query: 360 IAELKKLILKK 370 

IAELKKLILKK 
Sbjct: 361 IAELKKLILKK 371 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2774 

A DNA sequence (GASxl438R) was identified in S.pyogenes <SEQ ID 8047> which encodes the amino 
acid sequence <SEQ ID 8048>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1892 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — certainty=0. 0000 (Not Clear) < suco 

A related DNA sequence <SEQ ID 10439> was identified in GBS which encodes amino acid sequence 
<SEQID 10440>. 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 LIEVIIKKYLDEHLDVPSFFEHQKDEPARFIILEKTSGAKQNHLLSSTFAFQSYAESLYE 60 

+IE+IIK +LD HL V SF E + + P +1+ EKT +K NHLLSSTFAFQSYA S+YE 
Sbjct: 1 MIEIIIKNFLDTHLSVSSFLEKKGEMPLSYILFEKTGSSKSNHLLSSTFAFQSYAPSMYE 60 

Query: 61 AALLNDKVKQVIEQLDVLPQVSGVHLNADYNFrDTATKRYRYQAVFDINHY 111 

AA LN+++K+V+E+L L ++S V LN+DYNFTDT TK YRYQAVFDINHY 
Sbjct: 61 AAKLNEQLKEVVERLIELNEISIWSI^ T SDYNFrDTETKEYRYQAVFDINHY 111 

Based on this analysis, it was predicted that this GAS -specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2775 

A DNA sequence (GASxl442R) was identified in S.pyogenes <SEQ ID 8049> which encodes the amino 
acid sequence <SEQ ID 8050>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

>>> Seems to have no N-terminal signal sequence 
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Final Results 

bacterial cytoplasm Certainty=0. 1241 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

5 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 

10 Example 2776 

A DNA sequence (GASxl444R) was identified in S.pyogenes <SEQ ID 805 1> which encodes the amino 
acid sequence <SEQ ID 8052>. Analysis of this protein sequence reveals the following: 

Possible site: 42 

15 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 .4547 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
25 antigens for vaccines or diagnostics. 

Example 2777 

A DNA sequence (GASxl447R) was identified in S.pyogenes <SEQ ID 8053> which encodes the amino 
acid sequence <SEQ ID 8054>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

30 

>>> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

35 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

40 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2778 

A DNA sequence (GASxl448R) was identified in S.pyogenes <SEQ ID 8055> which encodes the amino 
acid sequence <SEQ ID 8056>. Analysis of this protein sequence reveals the following: 

45 Possible site: 20 
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>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3221 (Affirmative) < succi 

5 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2779 

A DNA sequence (GASxl449R) was identified in S.pyogenes <SEQ ID 8057> which encodes the amino 

acid sequence <SEQ ID 805 8>. Analysis of this protein sequence reveals the following: 

15 Possible site: 19 

»> Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm Certainty=0. 6356 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 
bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2780 

A DNA sequence (GASxl453R) was identified in S.pyogenes <SEQ ID 8059> which encodes the amino 
30 acid sequence <SEQ ID 8060>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

»> Seems to have no N-terminal signal sequence 

35 Final Results 

bacterial cytoplasm — Certainty=0. 2869 (Affirmative) < suco 
bacterial membrane --- Certainty=0 .0000 (Not Clear) < suco 
bacterial outside Certainty=0 .0000 (Not Clear) < suco 

40 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2781 

45 A DNA sequence (GASxl455R) was identified in S.pyogenes <SEQ ID 806 1> which encodes the amino 
acid sequence <SEQ ID 8062>. Analysis of this protein sequence reveals the following: 
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Possible site: 40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1787 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF43512 GB:AF145054 0RF19 [Streptococcus thermophilus 
bacteriophage 7201] 
Identities = 47/126 (37%) , Positives = 86/126 (67%) , Gaps = 2/126 (1%) 

Query: 8 LKDLRNLDLYIAS!jIRRRDKIEASLL--SSPI<>!SSDKVNGGIKRKQDDVYVELIATAKDI 65 

++ ++ LD YI S I + ++E+ L +S +D V GG ++ +DD4-YVELI +++ 
Sbjct: 7 IQQIKALDRYIESQIEQIICRLESQALICVTSGSMHTDIWQGGKRKGKDDIYVELITAREEV 66 

Query: 66 EKKTAEAIRKQRELQNLIDSLENTDSQTILSMVYIDKMTRWQVIDELNCSESTYFRLLRV 125 

E+ TAEAI+++ E + I ++E+ D++++L MVYID+++ WQ+ D++ S++TY+ LR 
Sbjct: 67 ERFTAFAIKQKLEFRRQIANlEDIDARSLLQm^IDQLSIWQICDI<MGISKATYYVKLRQ 126 

Query: 126 ATKELN 131 

A K L+ 
Sbjct: 127 AEKYLD 132 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2782 

A DNA sequence (GASxl456R) was identified in S.pyogenes <SEQ ID 8063> which encodes the amino 
acid sequence <SEQ ID 8064>. Analysis of this protein sequence reveals the following: 
Possible site: 34 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2883 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB18697 GB:D38906 0RF22 [Bacteriophage rlt] 
Identities = 78/207 (37%) , Positives = 123/207 (58%) , Gaps = 2/207 (0%) 

Query: 6 EIHRILGIDEVYKAPKRLTDILFDKDSREDIFRQFLJCYETDVSYDWFMQYFEEEQADRKN 65 

+ + +L +DE R+ +++FDK RE+ + + L D+ D+F YF A 

Sbjct: 7 QFYDMLNVDEHKMFTNRIQELVFDKKGREEFYSKILNIHHDMGVDFFRDYFMAHSAVSA- 65 

Query: 66 KKQDFTPKSVSTLLSKIISGNQYYEVA-VGTGGILIQAWQEQRLNDSPFTYRPSKYWYHV 124 

K Q +TP + L + ++ G+ ++ GTG ++IQ WQ+ R+N F Y PS YWY 
Sbjct: 66 KGQHYTPDELGKLTALLVGGSGGADLTGAGTGTLIIQICWQDDRMNTDFFNYLPSNYWYQA 125 

Query: 125 EELSDKAVPFLLFNMSIRGINGVVVHGDSLTRQVKNIYFLQNTKDDMLSFSDINVMPRTQ 184 

ELSD+A+ FL+ +IRG+NGW+HGD+L VK +YF+QN+ ++ + FS+INV+P ++ 
Sbjct: 126 LELSDEAI SFLIHAFAIRGMNGWIHGDM«EMAVKQVYF IQNS AKNP IGFSEINVI PHSK 185 
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Query: 185 DIEREFNVKEWIGDGIEHIENPLIEWI 211 

D + EW IEHIE+ +WI 

Sbjct: 186 DAMEFLGIHEWTEQAIEHIESKFPDWI 212 

5 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2783 

A DNA sequence (GASxl459R) was identified in S.pyogenes <SEQ ID 8065> which encodes the amino 
acid sequence <SEQ ID 8066>. Analysis of this protein sequence reveals the following: 

10 Possible site: 16 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -2.44 Transmembrane 82 - 98 ( 81 - 98) 

15 Final Results 

bacterial membrane Certainty=0. 1977 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Mot Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Mot Clear) < suco 

20 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2784 

25 A DNA sequence (GASxl460R) was identified in S.pyogenes <SEQ ID 8067> which encodes the amino 
acid sequence <SEQ ID 8068>. Analysis of this protein sequence reveals the following: 
Possible site: 55 

»> Seems to have no N-terminal signal sequence 

30 

Final Results 

bacterial cytoplasm Certainty=0. 33 68 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2785 

A DNA sequence (GASxl461R) was identified in S.pyogenes <SEQ ID 8069> which encodes the amino 
acid sequence <SEQ ID 8070>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

45 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 2834 (Affirmative) < suco 
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bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2786 

A DNA sequence (GASxl462R) was identified in S.pyogenes <SEQ ID 8071> which encodes the amino 
acid sequence <SEQ ID 8072>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3531 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2787 

A DNA sequence (GASxl463R) was identified in S.pyogenes <SEQ ID 8073> which encodes the amino 
acid sequence <SEQ ID 8074>. Analysis of this protein sequence reveals the following: 
Possible site: 58 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2483 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB14569 GB:Z99117 similar to phage-related protein [Bacillus subtilis] 
Identities = 98/252 (38%), Positives = 152/252 (59%), Gaps = 29/252 (11%) 

Query: 16 SPAVKNRIEQWGARAEQFTTSLLSIIS^WIiIiAKATSESIMGAAMKAAVUILPIEPSLG 75 

SP+V R E+V+G RA OFT S+LS+ ++ +L K S++ +AM AA L+LPI+ +LG 
SbjCt: 33 SPSVIKRFEEVLGKRATQFTASILSLYNSEQMLQKTDPMSVISSAMVAATLDLPIDKNLG 92 

Query: 76 FAYWPYNRNYKDG>EWITVNEAQFQIGYRGL±QLAQRSGQVRNIEHGIIYEEEFLGYDK 135 

+A++VPY +AQFQ+GY+G IQLA R+GQ ++I I+E E ++ 

Sbjct: 93 YAWIVPYG GKAQFQLGYKGYIQLALRTGQYKSINCIPIHEGELQKWNP 140 

Query: 136 IRGQLKLTGDYVDSGWKGYFASLELISGFYKMIEWPKEKVYEHAKKYSKTFDKKTGDFK 195 

+ ++++ + +S V GY A ELI+GF K +4-W K +V +H KK+SK+ DF 
Sbjct: 141 LTEEIEIDFEKRESDAVIGYAAYFELINGFRKTVYWTKAQVEKHKKKFSKS DF- 193 
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Query: 196 PGTPWATEFDPMAIKTLLKELLSKYAPLSVEMQDA-LEADNMSTIVIPICDVTPQE'raiSIi 254 

W ++D MA+KT+LK +LSK+ LSVEMQ A +E D I D+T + +S 
Sbjct: 194 GWKNDWDAMALKTVLKAVLS KWGI L S VEMQKAVI EEDETRERI DITNEADSS- 245 

5 

Query: 255 DDLIGTQNEKKD 266 

++I ++ KD 
Sbjct: 246 -EIIDSEPSNKD 256 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2788 

A DNA sequence (GASxl464R) was identified in S.pyogenes <SEQ ID 8075> which encodes the amino 
acid sequence <SEQ ID 8076>. Analysis of this protein sequence reveals the following: 

15 Possible site: 30 

»> Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm — Certainty=0 .4258 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2789 

A DNA sequence (GASxl465R) was identified in S.pyogenes <SEQ ID 8077> which encodes the amino 
30 acid sequence <SEQ ID 8078>. Analysis of this protein sequence reveals the following: 

Possible site: 51 

>» Seems to have no N-terminal signal sequence 

35 Final Results 

bacterial cytoplasm Certainty=0. 2045 (Affirmative) < suco 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

40 No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2790 

45 A DNA sequence (GASxl469R) was identified in S.pyogenes <SEQ ID 8079> which encodes the amino 
acid sequence <SEQ ID 8080>. Analysis of this protein sequence reveals the following: 

Possible site: 19 
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>>> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 . 3000 (Affirmative) < suco 

5 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

10 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2791 

A DNA sequence (GASxl470R) was identified in S.pyogenes <SEQ ID 8081> which encodes the amino 
acid sequence <SEQ ID 8082>. Analysis of this protein sequence reveals the following: 

15 Possible site: 37 

>>> Seems to have no N-terminal signal sequence 

Final Results 

20 bacterial cytoplasm Certainty=0. 3577 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

25 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC98430 GB:L29324 excisionase [Streptococcus pneumoniae] 
Identities = 23/56 (41%) , Positives = 41/56 (73%) 

Query: 23 KHLIQQWEGLTVATAKQWATEMRDHPDFKQFVLNPTHRIVFIDYKGFKLFVQWKSR 78 
30 K ++++W+GL T +W EMR++ F +V+NPTH++VFI+ +GF+ F++WK + 

Sbjct: 19 KGILKRWDGLNKYTLNRWIKEMRENRTFSMYVINPTHKLVFINLEGFESFLRWKQK 74 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2792 

A DNA sequence (GASxl473) was identified in S.pyogenes <SEQ ID 8083> which encodes the amino acid 
sequence <SEQ ID 8084>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

40 >» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2725 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

45 bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
50 antigens for vaccines or diagnostics. 
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Example 2793 

A DNA sequence (GASxl476) was identified in S.pyogenes <SEQ ID 8085> which encodes the amino acid 
sequence <SEQ ID 8086>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

5 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1422 (Affirmative) < suco 

10 bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2794 

A DNA sequence (GASxl480R) was identified in S.pyogenes <SEQ ID 8087> which encodes the amino 

acid sequence <SEQ ID 8088>. Analysis of this protein sequence reveals the following: 

20 Possible site: 25 

>» Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -4.04 Transmembrane 291 - 307 ( 290 - 309) 

25 Final Results 

bacterial membrane — Certainty=0. 2614 (Affirmative) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2795 

35 A DNA sequence (GASxl489R) was identified in S.pyogenes <SEQ ID 8089> which encodes the amino 
acid sequence <SEQ ID 8090>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

»> Seems to have no N-terminal signal sequence 

40 

Final Results 

bacterial cytoplasm Certainty=0. 227 8 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2796 

A DNA sequence (GASxl490R) was identified in S.pyogenes <SEQ ID 8091> which encodes the amino 
5 acid sequence <SEQ ID 8092>: 

SFITSVIAFRKLLKCEGIDLYL^GDLMTCFEQLLTQLK^ 

QPYKVFTPYYRIWQNYPKETPIKVELSQGRWLNLETPDDVLRTVESF^ 

SPFLRIGAIGIRTVYHAWQAPNSLGQATFLKEIAWRDFYIM 

QKTGSWHITOLRMIVASFLTKDLLCDWRIjGEQYFQQQLIDYDAASNIGGWQWAASTGTDAVPYFRIFNPVTQGKRFDPKGEFIECAYLPQLEH 
10 VPEKYLHEPWKMPKNLQESVSCIIGTDYPQPIVDHAKQREQAIAKYEWAKEKAKIE 

Analysis of this protein sequence reveals the following: 

Possible site: 33 

15 »> Seems to have an uncleavable N-terra signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < succ> 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA22361 GB:M94110 DNA photolyase [Bacillus firmus] 
Identities = 175/338 (51%) , Positives = 228/338 (66%) , Gaps = 6/338 (1%) 

Query: 145 EIINQSGQPYKVFTPYYRIWQNYPKETP--IKVELSQGRWI^LETPDDVLRTVES--FKD 200 

+++ + G PYKVFTPYY+ W K TP IK ++ G PD T+ + K 

Sbjct: 2 QVIjKKDGTPYKVFTPYYKAWAKERKRTPAVIKRDVLLGSVHKGTAPDREAETIiFNNLIKK 61 

Query: 201 EKYQDVATFDE-ASKQLNRFIQDQLAAYHANRDFPAQLGTSRLSPFLRIGAIGIRTVY-H 258 

Y A +E A K+L F + +L+ Y ANRDFP+ GTSRLSP+++ GA+ R++Y H 
Sbjct: 62 CSYDWSAIGEEHAIKRLQMFTKKRLSGYKANRDFPSITGTSRLSPYIKTGAVSSRSIYYH 121 

Query: 259 ATOQAPNSLGQATFLKELAWRDFYNMVYVAYPDQKTQPIQKAFSQIEWVNNPDWFQLWKE 318 

+ +S TFLKELAWRDFY MV+ PD K + I + + ++ W ++ D WK 
Sbjct: 122 ILNAEADSYSAETFLKEIAWRDFYRMVHFYEPDCKDREIMEGYRELNWSHDQDDLTSWKR 181 

Query: 319 GKTGYPIVDAAMLQLQKTGWMHNRLRMIVASFLTKDLLCDWRLGEQYFQQQLIDYDAASN 378 

G+TG+PIVDA M QL GWMHNRLRMI ASFLTKDLL DWRLGE+YF++ LIDYD +SN 
Sbjct: 182 GETGFPIVDAGMRQLLNEGWMHNRLRMITASFLTKDLLIDWRLGERYFERMLIDYDPSSN 241 



Query: 379 IGGWQWAASTGTDAVPYFRIFNPVTQGKRFDPKGEFIKAYLPQLEHVPEKYLHEPWKMPK 438 

IGGWQWAAS GTDAVPYFRI FNPVTQ KRFD G +1+ Y+P+L HVP+ Y+HEPWKM + 
Sbjct: 242 IGGWQWAASVGTDAVPYFRIFNPVTQSKRFDENGTYIRTYIPELNHVPDHYIHEPWKMSE 301 

Query: 439 NLQESVSCI IGTDYPQPIVDHAKQREQAIAKYEWAKEK 476 

Q C + DYP PIVDH+KQR++A++ E+ 
Sbjct: 302 EEQVKYKCRLDEDYPLPIVDHSKQRKKALSFFKGDDEE 339 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2797 

A DNA sequence (GASxl493R) was identified in S.pyogenes <SEQ ID 8093> which encodes the amino 
acid sequence <SEQ ID 8094>. Analysis of this protein sequence reveals the following: 
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Possible site: 39 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2748 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be usehil 
antigens for vaccines or diagnostics. 

Example 2798 

A DNA sequence (GASxl501R) was identified in S.pyogenes <SEQ ID 8095> which encodes the amino 
acid sequence <SEQ ID 8096>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

i cleavable N-term signal seq. 

80 ( 53 - 83) 

Final Results 

bacterial membrane Certainty=0. 3909 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC95443 GB:AF068901 YlmG [Streptococcus pneumoniae] 
Identities = 35/81 (43%) , Positives = 58/81 (71%) 

Query: 1 MILILSILLRLIKVYTYLLIAYALMSWFPGAYDSKIGRLISGIVEPILKPFRAFNLQFAG 60 

MI ++ ++ + +Y+ +L+A+A+MSWFPGAY+S +GR I +V+P+L P + LQ AG 
Sbjct: 1 MIFLIRMIYNAVDIYSLILVAFAVMSWFPGAYESSLGRWIVALVKPVLAPLQRLPLQIAG 60 

Query: 61 LDFTIFWIISUJFLAQVLVR 81 

LD +++V 1+ + FL + LVR 
Sbjct: 61 LDLSVWVAIVLVRFLGENLVR 81 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2799 

A DNA sequence (GASxl502) was identified in S.pyogenes <SEQ ID 8097> which encodes the amino acid 
sequence <SEQ ID 8098>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

10 N-terminal signal sequence 

33 ( 17 - 33) 

- Final Results 

bacterial membrane Certainty=0 .1956 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-speciflc protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2800 

A DNA sequence (GASxl507) was identified in S.pyogenes <SEQ ID 8099> which encodes the amino acid 
sequence <SEQ ID 8100>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

10 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0865 (Affirmative) < suco 

15 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2801 

A DNA sequence (GASxl511R) was identified in S.pyogenes <SEQ ID 8101> which encodes the amino 
acid sequence <SEQ ID 8102>. Analysis of this protein sequence reveals the following: 

25 Possible site: 47 

>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-11.83 Transmembrane 31 - 47 ( 22 - 53) 
INTEGRAL Likelihood = -0.96 Transmembrane 2 - IB ( 1-18) 

30 

Final Results 

bacterial membrane Certainty=0. 5734 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) c suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2802 

A DNA sequence (GASxl516R) was identified in S.pyogenes <SEQ ID 8103> which encodes the amino 
acid sequence <SEQ ID 8104>. Analysis of this protein sequence reveals the following: 

Possible site: 42 
45 »> Seems to have no N-terminal signal sequence 
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Final Results 

bacterial cytoplasm Certainty=0. 2729 (Affirmative) < succ: 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MTNTLVENIYASVTHNISKKEASKNEKTKAVINQAVADLSVAA.SIVHQVHWYM 60 

MTNT+ ENIYAS+ H + KKE S NEKTKAVLNQAVADLS AASIVHQVHWYMRG GFLY 
Sbjct: 1 MTNTITENI YAS 1 1 HQVEKKENSGNEKTKAVLNQAVADIiSKAAS IVHQVHWYMRGSGFLY 60 

Query: 61 LHPKMDELLDSLNANLDEMSERLITIGGAPYSTIAEFSKHSKLDFAKGTYDKTVAQHLAR 120 

LHPKMDEL+D+LN +LDE+SERLITIGGAP+STL EF ++S+L+E GT+DK++ HL R 
Sbjct: 61 LHPKMDELMDALNGHLDEISERLITIGGAPFSTLKEFDENSRLEETVGTWDKSITDHLKR 120 

Query: 121 LVEVYLYLSSLYQVGLDITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERGQGPAL 175 

LV+VY YLSSLYQVGLD+TDEE DA +ND+FTAA+TEA+KTIWMLQAE GQ P L 
Sbjct: 121 LVQVYDYLSSLYQVGLDVTDEEDDAVSNDIFTAAQTEAQKTIWMLQAELGQAPGL 175 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2803 

A DNA sequence (GASxl517) was identified in S.pyogenes <SEQ ID 8105> which encodes the amino acid 
sequence <SEQ ID 8106>. Analysis of this protein sequence reveals the following: 

Possible site: 46 



INTEGRAL 



3 have an uncleavable N-term signal seq 



Likelihood = 
Likelihood = -£ 
Likelihood = -£ 
Likelihood = -< 
INTEGRAL Likelihood = 
INTEGRAL Likelihood = 
INTEGRAL Likelihood = 



• 126) 



Transmembrane 109 - 125 ( 3 

Transmembrane 63 - 79 ( 

Transmembrane 154 - 170 ( 151 - 176) 

Transmembrane 189 - 205 ( 189 - 205) 

Transmembrane 130 - 146 ( 127 - 147) 

Transmembrane 6 - 22 ( 1 - 24) 

Transmembrane 83 - 99 ( 83 - 101) 



■ Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



- Certainty=0. 3527 (Affirmative) = 

- Certainty=0. 0000 (Not Clear) < £ 

- Certaxnty=0. 0000 (Not Clear) < £ 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAA96471 GB:AB036428 type IV prepilin peptidase homologue 
[Streptococcus mutans] 
Identities = 55/127 (43%) , Positives = 78/127 (61%) , Gaps = 3/127 (2%) 

Query: 83 VSASYCYLLLFSLLFSLFDWRSQEYPFILWLFSFVSLLLFYSINYLSLILLLLGLLAHLR 142 

++ S hh +L SL+D + Q YP LW+ L+ Y +N +SLIL L G+ A L+ 

Sbjct: 91 LTTSQVCLLFMGVLLSLYDLQDQSYPLTLWIGFTFLLMFIYPLNLISLILFLFGIFAALK 150 

Query: 143 PFSIGAGDFFYLASLALVLDLTSLW^IQLASLAGITACLLLGIKRIP — FIPYLSFGLF 200 

+ IG+GDFFYLA+LAL L+L +IW+IQ+ASL GI LL + p F+P+L G 
Sbjct: 151 NINIGSGDFFYLATLALSLNLQQIIWIIQIASLLGILYSLLFQKHKEPFAFVPFLFLG-H 209 

Query: 201 WIVLLEH 207 
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Sbjct: 210 LIIIFSH 216 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2804 

A DNA sequence (GASxl538R) was identified in S.pyogenes <SEQ ID 8107> which encodes the amino 
acid sequence <SEQ ID 8108>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

10 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1186 (Affirmative) < suco 

15 bacterial membrane Certainty=0. 0000 (Mot Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2805 

A DNA sequence (GASxl539R) was identified in S.pyogenes <SEQ ID 8109> which encodes the amino 
acid sequence <SEQ ID 81 10>. Analysis of this protein sequence reveals the following: 

25 Possible site: 34 

n signal seq 



Final Results 

bacterial membrane Certainty=0 . 5692 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF31453 GB:AF221126 putative histidine kinase [Streptococcus pneumoniae] 
Identities = 141/301 (46%), Positives = 210/301 (68%), Gaps = 7/301 (2%) 

Query: 1 MKRYPLLVQLISYVFVIVI7ALITTLGLLYYQTSSRNIRQLIERDTRQSIRQSSQFIDAYI 60 

MKR LLV+++ +F++ + L+ +G YYQ+SS I IE +++ +1 Q+S FI +YI 
Sbjct: 1 MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYI 60 

Query: 61 KPLKETTSVXAKNTEIQAFASQIHQENDKQXTjQLMI^WTLATI^SDLQAAVLVTKIJGRTVST 120 

K L+ T++ L + T++ A+A Q+ + + L +L ++ DL+ VLVTK G+ +ST 
Sbjct: 61 KKLETTSTGLTQQTDVmYAENPSQDKVEGIRDLFLTILKSDKDLKTWLVTKSGQVIST 120 

Query: 121 NSQLTMKTSSD^MAEPWYKAAIDRCAMPILTPARQLSLSSKKEWWSOTQEVVDRAGHNL 180 

+ + MKTSSDMMAE WY+ AI + AMP+LTPAR+ S +WV+SVTQE+VD G NL 
Sbjct: 121 DDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARK SDSQWVI S VTQELVDAKGANL 176 



Query: 181 GVLRLDlAYPTIKASLDQLQIiSRQGFAFI^roraEBVYHPKKSWSSSKEMAAMKPYLAI 240 
GVLRLDI+Y T++A L+QLQLG+QGFAFI+N+ HEFVYHP+ +VYSSS +M AMKPY+ 
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Sbjct: 177 GVmLDISYETLEAYmQLQLGQQGFAFIINEifflEFVYHPQHTVYSSSSKMEflMKPYIDT 236 

Query: 241 QNC3YTKDKTSFWQKLIPNSQOTLVGVMLDQLHRVQRQIFWSFSWNRASTLSDLWLCNCL 301 

GYT S+V Q+ I + WT++GV+SL++L +V+ Q+ W+ ++++ L +C CL 

Sbjct: 237 GQGYTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTL LGASVTSLLVCLCL 294 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2806 

A DNA sequence (GASxl542R) was identified in S.pyogenes <SEQ ID 8111> which encodes the amino 
acid sequence <SEQ ID 8112>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

»> May be a lipoprotein 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC23101 GB:U32823 conserved hypothetical protein [Haemophilus influenzae Rd] 
Identities = 56/128 (43%) , Positives = 87/128 (67%) 

Query: 73 DFELKGIDGKTYRLSEFKGKKVYLKFWASWCSICLSTIJUJTEDLJUCMSDKDYVVLTWSP 132 

D +LK ++ + LS++KGK VY+K WASWC ICL+ LA+ +DL+ D+++ V+T+VSP 
Sbjct: 24 DVQLKDLNNQPVTLSQYKGKPVYVKMWASWCPICLAGLAEIDDIiSAEKDRNFEVITIVSP 83 

Query: 133 GHQGEKSFjADFKKWFQGTDYKDLPVLLDPDGKLLEAYGVRSYPTEVFIGSDGVLAKKHIG 192 

H+GEK ADF +W++G +YK++ VLLD G++++ VR YP +F+ SD L K G 
Sbjct: 84 DHKGEKDTADFIEWYKGLEYKNITVLLDEKGEIIDKARWGYPFNLFLDSDLNLKKTVPG 143 

Query: 193 YAKKSDIK 200 

+ 1+ 
Sbjct: 144 HLGAEQIR 151 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



40 Example 2807 

A DNA sequence (GASxl543R) was identified in S.pyogenes <SEQ ID 8113> which encodes the amino 
acid sequence <SEQ ID 8114>. Analysis of this protein sequence reveals the following: 

Possible site: 13 



45 
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have a cleavable N-te 


cm signal seq. 
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Final Results 

bacterial membrane 
55 bacterial outside 

bacterial cytoplasm 



■-- Certainty=0. 4100 (Affirmative) < suco 

— - Certainty=0. 0000 (Not Clear) < suco 

— - Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC23102 GB:D32823 cytochrome C-type biogenesis protein 
[Haemophilus influenzae Rd] 
Identities = 106/224 (47%), Positives = 138/224 (61%), Gaps = 16/224 (7%) 

VLMVSVFGAGLLSFFSPCIFPVLPVYLGILLDADDSKTITIFGKKLYWYGIVKTLaFIFG 65 
+L+ +VF AGL SF SPCIFP++P+Y GIL GKK ++ T FI G 

LLIGTVFLAGLASFLSPCIFPIIPIYFGILSKG GKK VLNTFLFILG 51 

LSTIFVILGYGAGFLGNILYAVWFRYLLGALV1ILGIHQMGLITIKSLQFQKSLTFHNNK 125 
LS FV LG+ GFLGNIL++ R + G +VIILGIHQ+G+ I L+ K + + 
LSLTFVSLGFSFGFLGNILFSNTTRIIAGVIVIILGIHQL3IFKIGLLERTKLVEIKTSG 111 

IMGLFNAFILGLTFSFGWTPCVGPVLSSVIADVASGGNGAWQGGVLMIIYTLGLGIPFL 185 
L AF+LGLTFS GWTPC+GP+L+SVLAL G+ A G +M +Y LGL PF+ 



25 antigens for vaccines or diagnostics. 
Example 2808 

A DNA sequence (GASxl544) was identified in S.pyogenes <SEQ ID 81 15> which encodes the amino acid 
sequence <SEQ ID 81 16>. Analysis of this protein sequence reveals the following: 
Possible site: 25 

30 

»> Seems to have no N-terminal signal sequence 





6 


Sbjct: 


6 


Query: 


66 


Sbjct: 


52 




126 


Sbjct: 


112 


Query: 


186 


Sbjct: 




Based on 


this 



Final Results 

bacterial cytoplasm Certainty=0. 1493 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2809 

A DNA sequence (GASxl546R) was identified in S.pyogenes <SEQ ID 8117> which encodes the amino 
acid sequence <SEQ ID 8118>. Analysis of this protein sequence reveals the following: 

45 Possible site: 46 

>>> Seems to have no N-terminal signal sequence 

Final Results 

50 bacterial cytoplasm Certainty=0 .4658 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04061 GB:AP001508 unknown conserved protein in others 
[Bacillus halodurans] 
Identities = 48/89 (53%) , Positives = 61/89 (67%) 

Query: 1 ^mvLVTYDWTETPAGRKRLRHVAKLCVDYGQRVQNSVPECSVTPAEFVDIKHRLTQIID 60 

M+VL+TYDV T + G KRLR VAK C +YGQRVQNSVFEC V + +K LT +ID 
Sbjct: 1 MLVLITYDVQTSSMGGTKRLRKVAKACQNYGQRVQNSVFECIVDSTQLTSLKLELTSLID 60 

Query: 61 EKTDSIRFYLLGKNWQRRVETLGRSDSYD 89 

E+ DS+R Y LG N++ +VE +G S D 
Sbjct: 61 EEKDSLRIYRLGNNYKTKVEHIGAKPSID 89 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2810 

A DNA sequence (GASxl547R) was identified in S.pyogems <SEQ ID 8119> which encodes the amino 
acid sequence <SEQ ID 8120>. Analysis of this protein sequence reveals the following: 

Possible site: 57 

10 N-terminal signal sequence 

60 ( 43 - 60) 



Final Results 

bacterial membrane Certainty=0. 1680 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

RGD motif: 330-332 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04060 GB:AP001508 unknown conserved protein in others 
[Bacillus halodurans] 
Identities = 162/341 (47%) , Positives = 231/341 (67%) , Gaps = 1/341 (0%) 

Query: 1 MKKLLNTLYLTQEDFYVTKEGDNI VI KQEGICVLKRFPFRI IDGI VCFSYLGVSSALVKLC 60 

MKKLLNTLY+TQ D Y++ +GDN+V+ +E + L R P ++ IV F Y G S AL+ C 
Sbjct: 1 MKKLLirrLYVTQPDTYLSLDGDNVVLLKEQEKLGRLPLHNLEAIVGFGYT>FEATURESALMGYC 60 

Query: 61 TENQINLSFHTPQGRFCGRYIGSTNGNVLLRREHYRLSDRE-ESLEYAKRFILAKISNSR 119 

E I+++F T GRF R +G + GNV+LR+ YR+S+ + ES + A+ FI K+ NS+ 
Sbjct: 61 AERNISITPLTKNGRF^WGESRGNVVLRKTQYRISENDQESTKIARNFITGKVYNSK 120 

Query: 120 KYLLRFKRDHRQQIDTKLFEAVNDELIWALEMVQAADNKDSLRGIEGQAANQYFRIFNDL 179 

L R R+H +++ + F+A + L ++ ++ D+ +SLRG EGQAA Y ++F+ + 
Sbjct: 121 W^ERMTREHPI^VlWEQFKATSQLLSVMMQEIRNaJSLESLRGVffiGQAAINYNKVFDQM 180 

Query: 180 V1TDKKTFYFQGRSKRPPLDCVNALLSFGYSLLTFECQSALEAVGLDSYVGFFHTDRPGR 239 

+L K+ F F GRS+RPP D VNA+LSF Y+LL + +ALE VGLD+YVGF H DRPGR 
Sbjct: 181 ILQQKEEFAFHGRSRRPPKDNVNAMLSFAYTLLANDVAAALETVGLDAYVGFMHQDRPGR 240 

Query: 240 ASLALDLVEEFRSYIVDRFVFSLINKGQLQKKHFEVKENGSILLTENGRAIFIDLWQKRK 299 

ASLALDL+EE R DRFV SIiIN+ ++ F KENG++L+T+ R F+ WQ +K 
Sbjct: 241 ASIALDLMEELRGLYADRFVLSLINRKEMTADGFYKKENGAVLMTDEARKTFLKAWQTKK 300 

Query: 300 HTEVEHPFTKEKVKLMLLPYVQAQLIAKAIRGDJjESYPPFM 340 

++ HP+ EK+ L+PYVQA LLA+ +RGDL+ YPPF+ 
Sbjct: 301 QEKITHPYLGEKMSWGLVPWQALLLARFLRGDLDEYPPFL 341 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2811 

A DNA sequence (GASxl548R) was identified in S.pyogenes <SEQ ID 8121> which encodes the amino 
acid sequence <SEQ ID 8122>. Analysis of this protein sequence reveals the following: 

Possible site: 49 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2247 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04059 GB:AP001508 unknown [Bacillus halodurans] 
Identities = 90/169 (53%), Positives = 111/169 (65%), Gaps = 1/169 (0%) 

Query: 45 LHTKADNPYIKEKRKELLVSRAMPISSAELGLSGIMDWEFYKDDQGVSLRGKRGKWLPK 104 

+H KAD P++KEKR L RAMPI S L +SGI DWEF +D +G+ L G G + 
Sbjct: 1 MHKKADQPFMKEKRGSKLTVRAMPIQSKNLQISGICDVVEFVQDSEGIELSGVSGSYKAF 60 

Query: 105 VVEyKRGKPKKDTRTJlVQLVAQTMCLEETLDCDINEGCLYYIISVNQRVIVPMTSALRQEV 164 

VEYKRGKPKK DIVQLVAQ MCLEE L C I++G L+Y+ + RV VP+T ALR +V 
Sbjct: 61 PVEYKRGKPKKGDEDIVQLVAQAMC1EEMLVCRIDKGYLFYNEIKHRVEVPITDALRDKV 120 

Query: 165 KELAAEMHEVYQSQMLPKAAYFKNCQLCSLVDICKPRLSKKTRSVSRYI 213 

++A EMH Y+++ PK C CSL 1C P+L K RSV RYI 

Sbjct: 121 VQMAKEMHHYYENRHTPKVKTGPFCNNCSLQSICLPKLMNK-RSVKRYI 168 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



35 Example 2812 

A DNA sequence (GASxl549R) was identified in S.pyogenes <SEQ ID 8123> which encodes the amino 
acid sequence <SEQ ID 8124>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

40 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1399 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

45 bacterial outside Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04058 GB:AP001508 unknown conserved protein in others 
50 [Bacillus halodurans] 

Identities = 148/290 (51%), Positives = 190/290 (65%), Gaps = 19/290 (6%) 

Query: 6 ffiEHKIDFMVTLE VKEIUSANGDPIJSIGNMPRTDAKGYGVMSDVS I KRKIRNRLQDMGKS IF 65 
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Query: 66 VQANERIEDDFRSLERRFSQH FTAKTPDKEIEEKANAL---WFDVRAFGQVFTYLK 11B 

VQ+++R D F+SL R + K + ++E A W DVR+FGQVF + 

Sbjct: 63 VQSDDRKADSFKSLRDRADSNPELWCI^KAKNASVDEFAKIACQEWMDVRSFGQVFAFKG 122 

Query: US K- -SIGVRGPVSISMAKSLEPIVISSLQITRSTNGMEAKNNSGRSSDTMGTKHFVDYGVY 17S 

S+GVRGPVSI A S++PI I S QIT+S N + RSSDTMG KH VD+GVY 

Sbjct: 123 SNLSVGVRGPVSIHTATSIDPIDIVSTQITKSVNSVTGDK— -RSSDTMGMKHRVDFGVY 179 

Query: 177 VLKGSINAYFAEKTGFSQEDAEAIKEVLVSLFENDASSARPEGSMRVCEVFWFTHSSKLG 236 

V KGSIN AEKTGF+ EDAE IK L++LFEND+SSARP+GSM V +V+W+ HSSKLG 
Sbjct: 180 VFKGSINTQLAEKTGFTNEDAEKI KRALI TLFENDS S SARPDGSMEVHKVYWWEHSSKLG 239 

Query: 237 NVSSARVFDLLEYHQSIEEKSTYDAYQIHLNQEKLAKYEAKGLTLEILEG 266 

SSA+V L+ + ++D Y + L YE GL +E+++G 

Sbjct: 240 QYSSAKVHRSLKIESKTDTPKSFDDYAVEL YELDGLGVEVIDG 282 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2813 

A DNA sequence (GASxl550R) was identified in S.pyogenes <SEQ ID 8125> which encodes the amino 
acid sequence <SEQ ID 8126>. Analysis of this protein sequence reveals the following: 

Possible site: 43 

>>> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm — Certainty=0. 2882 (Affirmative) < succ: 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MDFFTSLLKTYEKAELADL VDHQKR- -NNEPVLLPI YHTSLKSNGKNI I SVKLDKDGQFH 58 

M + L +TYE A L + K+ + E LLPI HT+ ++ IV LD+DG F 
Sbjct: 1 MSWLLHLYETYE-ANLDQVGKTVKKGEDREYTLLPISHTTQNAH 1 E VTLDEDGDFL 55 

Query: 59 KAEFMADKQMIIFPVTADSVARSGSHPAPHPLVDKFAYYSAEM GQIQ YDS 108 

+A+ + K+ + P T ++ +RSGS AP+PL DK +Y + + G+I+ +D+ 

Sbjct: 56 RAKALT-KESTLIPCTEEAASRSGSKVAPYPLHDKLSYVAGDFVKYGGKIKNQDDAPFDT 114 

Query: 109 FHKQLNNWID--YCEEGDVKKFLTFVQQFILKPEFJJTLILDSLIGPDYQHNQLKVTFCDA 166 

+ KL W+ Y E VK T++++ L+++L NQ+ + 

Sbjct: 115 YIKNLGEWANSPYATE-KVKCIYTYLKKGRLIEDLVDAGVLKL DENQQLIEKWEK 168 

Query: 167 TGKEKLIDLSACFLEFSIDQ FQGFKNESVSTF KALHQSYISFVEANRENLG 217 

+EL+AF +DQ FF ES+ K + S+ISF 

Sbjct: 169 RYEELLGEKPAIFSSGATDQASAFVRFNVFHPESIDDVWKDKEMFDSFISFYNDKLGEED 228 

Query: 218 ICNISGREEQLTDKH RGLMGNAKI I S VS -NKREAYKGRFRERED VFSVGYETSEKI 272 

IC ++G T4+H R AK+IS + N ++GRF+ + + YE S+K 

Sbjct: 229 ICFVTGNRLPSTERHAMKIRHAADKAKLISAITONSGFTFRGRFKTSREAVGISYEVSQKA 288 

Query: 273 HLMLK^LLENKNTSTWLGSSQYLINWFSDD-LraDSRLDIVSPIFDDGLEEDDDDDTPPV 331 

H LK+L+ ++ S + + W +D+ L + DV+ E + DDT + 

Sbjct: 289 HNALKWLIHRQSKSI DDRVFLVKSNDNSLVPNPDEDAVDIMKHANRELERDPDTGQI 345 
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Query: 332 ITLATEDNKRIGKSFIKGQKLFANDATY YVAIIiNKTSNGRIALKYFRQLQASQLLT 387 

A E K IG + +D Y ++ +L+ + GR+A+ Y+R L L 

Sbjct: 346 F- -AGEVKKAIGG YRSDLNYQPEVHILVLDSATTGRMAVLYYRSIiNKELYIiN 395 

Query: 388 NLNKWQETYSWESRSKFGKSRLRT PTFHDILNVSYGVDRDRFLELDNDNFKSDQIQ 443 

L W ++ +WE R + + + P DI +YG ++ D ++ 

Sbjct: 396 RLEAWHDSCAWEHRYRRDEKEFISFYGAPATKDIAFAAYGPRA SEKVIKDLME 448 

Query: 444 KLVASLIDGKPMPQSIVKKL GNNVKERHRYRKHWYQVEQVCLAILHK QNGEEFS 497 

+++ ++DG+ +P+ IV+ +N R+ W + + A++ K + EE+ 

Sbjct: 449 RMLPCIVDGRRVPKDIVRSAFQR&SNPVSMERWE--WEKTLSITCALIRKMHIEQKEEWG 506 

Query: 498 PMLDHTNQ1TOSYLFGRLIAIFELIETLRYGLDGNNNDRITNAERYWTAYTGQPTKLMMLL 557 

LD ++ +RSYJjFGRL1iA+ +++E G G + R TNA RY +Y+ P + + 
Sbjct: 507 VPLDKSSTDRSYLFGRLLAVADVLER GALGKDETRATNAIRYMNSYSKNPGRTWKTI 563 

Query: 558 ENKIKPYEEPLKLmRGSWMKLEKEKEEILELLNPLLETETMEKPLDYRFIFGYYAEKNY 617 

+ ++PY+ KL + ++ L K +EI + P + PL +++ G+Y+++ 
Sbjct: 564 QESLQPYQ- -AKLGTKATY- -IiSKLVDEIGDQFEP GDFNNNPLTEQYLLGFYSQRRE 616 

Query: 618 YYTKQNTEVTE 628 

Y K+ E + 
Sbjct: 617 LYKKKEEETNQ 627 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2814 

A DNA sequence (GASxl551R) was identified in S.pyogenes <SEQ ID 8127> which encodes the amino 
acid sequence <SEQ ID 8128>. Analysis of this protein sequence reveals the following: 
Possible site: 33 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3035 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04056 GB:AP001508 unknown [Bacillus halodurans] 
Identities = 90/218 (41%) , Positives = 127/218 (57%) , Gaps = 7/218 (3%) 

Query: 13 GQRALFTNPATKGGSERSSYSVPTRQALNGIVDAIYYKPTFTNIVTEVKVINQIQTELQG 72 

G ALFT+P TK G E+ SYSVPT QAIi GI ++IY+KPT ++ E++V+ IQ E +G 
Sbjct: 11 GDYALFTDPLTKIGGEKLSYSVPTYQALKGIA3SIYWKPTIVFVIDELRVMKPIQMESKG 70 

Query: 73 vRALLHDYSADLSWSYLSDWYLIKFHr^TWNEDRKDLNSDRLPAKHEAIMERSIRKGGR 132 

VR + + L++ +YL DV Y +K HF +N R DL DR KH +I++RS++ GGR 

Sbjct: 71 VRPIEYGGGNTIAHYTYLKDVHYQVKAHFEFNLHRPDIAFDRNEGKHYSILQRSLKAGGR 130 

Query: 133 PXIVFLGTRECLGLVDDISQEEYETTVSYYNGV-NIDLGIMFHSFAYPKDK-KTPLKSYFT 190 

RD+FLG REC G V + E+ + +Y+G LGMHFYP+ + L 

Sbjct: 131 RDIFLGARECQGYV APCEFGSGDGFYDGQGKYHLGTMVHGFNYPDETGQHQLDVRLW 187 

Query: 191 KTVMKNGVITFKAQSECDI VNTLSSYAFKA- - PEEIKS 226 

VM+NG IF +C IV + K P+ ++S 

Sbjct: 188 SAVMENGYIQFPRPEDCPIVRPVKEMEPKIFNPDNVQS 225 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2815 

A DNA sequence (GASxl552R) was identified in S.pyogenes <SEQ ID 8129> which encodes the amino 
acid sequence <SEQ ID 8130>. Analysis of this protein sequence reveals the following: 

Possible site: 53 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2770 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified hi S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04055 GB:AP001508 unknown conserved protein in others 
[Bacillus halodurans] 
Identities = 252/836 (30%) , Positives = 404/836 (48%) , Gaps - 90/836 (10%) 

Query: 3 MILAHYDCKKDKKQSLDEHLWHVACSSRQEASIIGQGDVLFLIGLYHDLGKADRTFQD-- 60 

M +AH Q+L EHL VC+ ++ VLGL HDLGK F+D 

Sbjct: 1 MYIAHIREVDKVIQTLKEHLCGVQCLftETFGAKLRLQHVAGLAGLLHDLGKYTNEFKDYI 60 

Query: 61 KLLNNPNRHVDHSYAGAKYLCSI IGPHLKNRGVDKNERMTFNEMVGYVI SAHH 113 

Sbjct: 

Query: 114 GMYDLCYYFDDAEYYGFNKFKNR I NRDLDG YHYHED I KGYALKLEKKLCDYGYK- DLREL 172 

+Y N + R L+ +++ Y +E+ + + +L 

Sbjct: 116 SNLQ DYISPTIESNFliTRVLE KELPEYESAVERFFQEVMTEAELARY 162 

Query: 173 1DKAFDNYQQAMSSLNWQDKSEWDYYQSCMVRLYLSLLKNADILDTVNAYGLKISPMDKT 232 

+ KA D +Q + Q Y SC++ +AD +T + + + T 

Sbjct: 163 VAKAVDEIKQFTDNSPTQSFFLTKYIFSCLI DADRTNT-RMFDEQAREEEPT 213 

Query: 233 ERSFLKHSYLAAIEQKYASFGQPNNQ LNTIRTEIAERVKERGKRDSKGIYRLDLPTG 289 

+ L Y + AS + ++ +N +R+ ++E+ + R S GIY L +PTG 
Sbjct: 214 QPQQLFEHYHQQLLNHLASLKESDSAQKPINVLRSAMSEQCESFAMRPS-GIYTIiSIPTG 272 

Query: 290 AGKTNLSMRYAFHQLTOHDKSRFFYITPFLSVLEQNftSEIRKVTGD-LGVLEHHSNVVKQ 348 

GKT S+RYA ++K R YI PF +++EQNA E+R 4- GD +LEHHSNW+ 

Sbjct: 273 GGKTLASLRYALKHAQE YNKQRI IYIVPFCTII EQNAQEVRNI LGDDENI LEHHSNWED 332 

Query: 349 ANEDDDDKDSLLSA YLSDSWDSQWLTSMVQFFQTLFKTKSANLRRFSSLINSW 403 

+ D+ +D +++ D+WD ++ T++VQF + + N RR +L +SV+ 

Sbjct: 333 SENGDEQEDGVITKKERLRLARDNWDRPI I FTTLVQFLNVFYAKGNRNTRRLHNLSHSVL 392 

Query: 404 ILDEVQSLPIEWTLFmTMWMKVM3TTIVLCTATQPAYDSSEIDHRICYGGNLGELA 463 

I DEVQ +P + +LFN +NFL + +I+LCTATQP ++ + H + + 
Sbjct: 393 IFDEVQKVPTKCVSLFNEALNFLKEFAHCSILLCTATQPTLEN--VKHSLLKDRD G 446 



Query: 464 EIVELTIEEKQIFSRTELRKFDDSDQKVHLTDVINLILGEE - - 

EIV+ E + F R E+ D +DQ + + + E S L I NTKK V + 

Sbjct: 447 EIVQNLTEVSEAFKRVEI- -LDKTDQPMTlTORLAEWVRDEAPSWGSTLIIliNTKKWKDL 504 

Query: 521 YTMLIODMTDRPWQLSTNMCAQHRLDLIAKIKXELQNNIPIICISTQLIEAGVDTO 580 

Y L+ PV+ LST+MCA HR D + +1+ L+ P IC++TQLIEAGVDV F V 

Sbjct: 505 YEKLEG-GPLPVFHLSTSMOUUlRKDQLDEIRAriKEGTPFICVTTQLIEAGVDVSFKCV 563 

Query: 581 IRSYSG1DSIVQAAGRCNREGKRDKGQVTLVNLTNEEENISRLTEIKTKKEATESILHKI 640 
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IRS +G+DSI QAAGRCNR G+ V +++ + EE +S+L EI+ +E ++L + 

Sbjct: 564 IRSIAGLDSIAQAAGRCNRHGEEQLQYVyVID--HAEETLSKLKEIEVGQEIAGNVLARF 621 

Query: 641 GSPIDISTLN RDFFEYYYANNQGLMDYPLED NLS I YDYLSLNI YQTAN 688 

+ N R++F YYY-t- ++Y +++ + + N Y T 

Sbjct: 622 KKKAEKYEGNLLSQAftMREYFRYYTSKM3ANI>KryFVKEVDKDMTKLLMSHAVEKSYVTYY 681 

Query: 689 KKFKGK LKQAFKTAGAKMNLINNDMIGILVPYGEAEKKLAYLEELGVSHFLSAKD 743 

+K G L ++KTA +1+ + +VPYGE + +A L S + 

Sbjct: 682 QKNTGTHFPLLLNGSYKTAADHFRVIDQNTTSAIVPYGEGQDIIAQLN SGEW 733 

Query: 744 YQTIKSLLKELQPFTVNV- -RENDPLFE- -TTKSYLNGQILVLTSEYYDTERGVKY 795 

+ +LK+ Q +TVN+ +E D L + +L+G + L +Y + GV + 

Sbjct: 734 VDDLSKVLKKAQQYTVNLYSQEIDQLKKEGAIVMHLDGMVYELKESWYSHQYGVDF 789 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2816 

A DNA sequence (GASxl558) was identified in S.pyogenes <SEQ ID 8131> which encodes the amino acid 
sequence <SEQ ID 8132>. Analysis of this protein sequence reveals the following: 
Possible site: 16 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1050 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2817 

35 A DNA sequence (GASxl563) was identified in S.pyogenes <SEQ ID 8133> which encodes the amino acid 
sequence <SEQ ID 8 1 34>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

>>> Seems to have no N-terminal signal sequence 

40 

Final Results 

bacterial cytoplasm Certainty=0. 1872 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 

antigens for vaccines or diagnostics. 
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Example 2818 

A DNA sequence (GASxl564R) was identified in S.pyogenes <SEQ ID 8135> which encodes the amino 
acid sequence <SEQ ID 8136>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

5 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2173 (Affirmative) < suco 

10 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2819 

A DNA sequence (GASxl566R) was identified in S.pyogenes <SEQ ID 8137> which encodes the amino 
acid sequence <SEQ ID 8138>. Analysis of this protein sequence reveals the following: 

20 Possible site: 43 

>» Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm — Certainty=0. 3486 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
30 antigens for vaccines or diagnostics. 

Example 2820 

A DNA sequence (GASxl568) was identified in S.pyogenes <SEQ ID 8139> which encodes the amino acid 
sequence <SEQ ID 8140>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

35 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2711 (Affirmative) < suco 

40 bacterial membrane Certainty=0 . 0 0 0 0 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



WO 02/34771 PCT/GB01/04789 
-2823- 

Example 2821 

A DNA sequence (GASxl569) was identified in S.pyogenes <SEQ ID 8141> which encodes the amino acid 
sequence <SEQ ID 8142>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

5 

>>> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

10 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2822 

A DNA sequence (GASxl576R) was identified in S.pyogenes <SEQ ID 8143> which encodes the amino 
acid sequence <SEQ ID 8144>. Analysis of this protein sequence reveals the following: 

20 Possible site: 28 

»> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm — Certainty=0. 4042 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2823 

A DNA sequence (GASxl577R) was identified in S.pyogenes <SEQ ID 8145> which encodes the amino 
35 acid sequence <SEQ ID 8146>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

»> Seems to have no N-terminal signal sequence 

40 Final Results 

bacterial cytoplasm — Certainty=0. 3342 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

45 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04515 GB:AP001S09 unknown [Bacillus halodurans] 
Identities = 36/104 (34%) , Positives = 55/104 (52%) 
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Query: 2 HMGAWNTGlTOKILYTQESVTDDMIAKRDQSIKDAKESPILGFTVDTKVIKrELSNISNVM 61 

+M ++ GN IL E D + + A SP LGF D+ ++TE++ ISNV 

Sbjct: 392 NMPSFAIGNQLI LKLYEDDPQDKMLAFEAFNESAI PSPALGFYFDSNPVRTE I AAI SNVT 451 

Query: 62 NRYKASIOTGTVDPDEALPKLIADLKGAGWDKVQKEVQKQLDDF 105 

+ + ++ G VDP+E LP L AG KV E+Q+Q D++ 

Sbjct: 452 SEFSPALLKGAVDPEEYLPLFNDKLNEAGLQKVIDEMQRQFDEW 495 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2824 

A DNA sequence (GASxl578R) was identified in S.pyogenes <SEQ ID 8147> which encodes the amino 
acid sequence <SEQ ID 8148>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

»> May be a lipoprotein 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04515 GB:AP001509 unknown [Bacillus halodurans] 
Identities = 134/346 (38%) , Positives = 206/346 (58%) , Gaps = 10/346 (2%) 



ANNY +NA++GAF -l 





21 


Sb j Ct : 


42 




81 


Sbjct: 


101 


Query: 


137 


Sb j Ct : 


161 


Query: 


197 


Sbjct: 


221 




257 


Sbjct: 


278 




317 


Sbjct: 


336 



DGKLYA P + V Q +LSFN ELV+K+ LD+S++ S AD E +L 



AAT+T+ +PLE WF+R+E P Y + I T AG +1 +RPL +P 



+S+ SKN E+A+ L+LLNSDP L N L G+EG +E++ D 1 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2825 

A DNA sequence (GASxl582) was identified in S.pyogenes <SEQ ID 8149> which encodes the amino acid 
sequence <SEQ ID 8150>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0454 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 



Example 2826 

A DNA sequence (GASxl584R) was identified in S.pyogenes <SEQ ID 8151> which encodes the amino 
acid sequence <SEQ ID 8152>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3105 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

RGD motif: 3-5 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAG21428 GB:AF307332 meningioma-expressed antigen 5s splice 
variant [Homo sapiens] 
Identities = 94/271 (34%) , Positives = 148/271 (53%) , Gaps = 14/271 (5%) 

G++EGFYG PW E+R + R + +NTY+YAPKDD R WR++Y + L 



f A+E ++F Y ISPGLD 



AVERRFKKTAYAQAHIATEVHHFLNQQHAAPELVICPTE YDNHHDSIYLQELSE 293 

A + F A+AQ + E++ +L + + CPTE Y N S YL+ + E 

ADKEVFSSFAHAQVSITNEIYQYLGEPET FLFCPTEYCGTFCYPNVSQSPYLRTVGE 23 9 





120 


Sbj Ct: 


63 




180 


Sbjct: 


123 


Query: 




Sbjct: 


183 




294 


Sbjct: 


240 


Query: 


354 


Sbjct: 


298 



- +R +IWDNI NDY D 4 



++KG+++NP +E 4 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2827 

A DNA sequence (GASxl585R) was identified in S.pyogenes <SEQ ID 8153> which encodes the amino 
acid sequence <SEQ ID 8154>. Analysis of this protein sequence reveals the following: 
Possible site: 24 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4469 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2828 

A DNA sequence (GASxl587) was identified in S.pyogenes <SEQ ID 8155> which encodes the amino acid 
sequence <SEQ ID 8156>. Analysis of this protein sequence reveals the following: 

Possible site: 47 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3082 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04509 GB:AP001509 unknown conserved protein in others 
[Bacillus halodurans] 
Identities = 221/425 (52%) , Positives = 296/425 (69%) , Gaps = 4/425 (0%) 

Query: 12 RPIPTSVSQFMAKVESLCGDQHPDKALNFKTSFTNTLETTLKTYEDGTSFLLTGDIPAMW 71 

+ IP S+ +A+V++ D L F+ F NT TT++ E GT F++TGDIPAMW 

Sbjct: 4 KKIPRSLQAIIAQVKAHYADDQELQTL-FEQCFLNTYLTTIQEDEQGT-FVVTGDIPAMW 61 

Query: 72 LRDSTAQMKPyLFIoAKEDEEIRKIIAGLVKRQFRYICIDPYANAFNEEANEKGHQTDHTQ 131 

LRDS+AQ++PYL + KED ++ ++I G+++RQ+RYI DPYANAFN+ AN++GHQ D T+ 
Sbjct: 62 LFJDSSAQWPYLTWKEDMMAP^IKGVIERQTOYILHDPYANAMOTANKQGHQQDRTE 121 

Query: 132 mPWIWERKYEIDCLCTPIQIAYIiVRETGSTDQFNDDFHRGVELILDLWrVEQDH-AQS 190 

M+P +WERKYE+D LCYPIQLAYL ++ TG + +E I +W +EQDH A+S 

Sbjct: 122 MSPLVWERKYELDSLCYPIQIAYLYWKATGDDSVLQPTLKQVLETIYRIWKIEQDHEAKS 181 

Query: 191 PYLFERDTWRKEDTLTHAGKGSPVAPTGMTWSGFRPSDiaCQYGYLIPSNMFAVVVLSYL 250 

Y FERD R DTL GKG PTGMTWSGFRPSDDAC YGYLIP+NMFAWV +Y 
Sbjct: 182 SYSFERDDCRVSDTLLRKGKGGYSVPTGMTWSGFRPSDDACLYGYLIPANMFAVWSNYA 241 
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Sbjct; 242 VELL1AM - EE I KLAEEFRELEAD IRQGIGQYGKMDHPVYGE I YVYETDGNGR VNLMDDAN 300 

Query: 311 IPSIjLAAPYLGFCTKDDPIYLATRRTILSQENPYYYQGNAflAGIGSSHTPENYIWHIALA 370 

+PSLLA PYLG+ T DDP+Y TRR IIjS++NPYYY+G+ A G+GS HTP++Y+WHI+LA 
Sbjct: 301 VPSLLAIPYLGYTTADDPVYQNTRRFILSRDNPYYYEGSYAKGVGSPHTPDHYVWHISIA 360 

Query: 371 LQGLTALDQDSKKEMLDLLVATDAGTHLMHEGPDVNDPYQYTREWFSWANMMFCELLLDY 430 

+QG+TA+D KK+++ + T A T+ MHEGFDV+ P QYTR WF+WAN MF E LL 
Sbjct: 361 IQGMTAIDSKEKKQIVAMFKQTHADTYFMHEGFDVDRPEQYTRSWFAWANSMFSEFLLSE 420 

Query: 431 LGFSI 435 
G + 

Sbjct: 421 AGIYV 425 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2829 

A DNA sequence (GASxl588) was identified in S.pyogenes <SEQ ID 8157> which encodes the amino acid 
sequence <SEQ ID 8158>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 5250 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agcdactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB04508 GB:AP001509 unknown conserved protein in others 
(divided) [Bacillus halodurans] 
Identities = 312/737 (42%), Positives = 426/737 (57%), Gaps = 21/737 (2%) 



Query: 183 ILGLLFANWYSNGNEIPTTEAFARLFWDKKLADAERFASTKHLLMMNGCDHQPVQLDVTK 242 

++G+L ANWYSNGNEIPT E EA+ FW KKL DAERFAST LL MNGCDHQPVQ DVT+ 
Sbjct: 63 VIGILLANWYSNGNEIPTDEDEAQTFWKKLRDAERFASTSQLLFMNGCDHQPVQKDVTQ 122 

Query: 243 AIALANQLYPDYEFVHSCFEDYIADLADDLPENLSTVQGEITSQETDGWYTLANTASARI 302 

AI +A L+PD F HS F DYL + ++LP+ L + GE+ +Q+TDGW TL NTASARI 
Sbjct: 123 AIKVAETLFPDVAFKHSNFHDYLTQIKEELPKELQKITGELRNQKTDGWSTLVNTASARI 182 

Query: 303 YLKQANTRVSRQLENITEPLAAMAYEVTSTYPHDQLRYAWKTLMQNHPHDS1CGCSVDSV 362 

YLKQAN R L N+ EP+ + + D Y WK LM+NHPHDSICGCS+D+V 

Sbjct: 183 YLKQANDRCQTLLTNVLEPMCLLV — ENKSLHRDFSEYYWKLLMENHPHDSICGCSIDAV 240 

Query: 363 HREmTRFEKAYEVGHYLAKEAAKQIAimiDTRDFPh©SQPFVLFNTSGHSKTSVAELSL 422 

HREM TRFEK E K+IA I+T ++ P V+ T+G S V + 

Sbjct: 241 HREMKTRFEKVEAGATTFIAECjGKEIAftQII^LHDSEFAIPLVVLKTNGTSGKRVVRHKV 300 

Query: 423 TWKKYHFGQRFPKEVYQEAQEYLARLSQSFQirDTSGQVRPEAEILGTSIAFDYDLPKRS 482 

KK +F + ++ + L + ++ + E+ + F YDLP+ 

Sbjct: 301 AMKKIYFDEM DFRHIPDRLKEIVMPTYRLEFPNKGSVPIEVQDAGVRFGYDLPRDG 356 

Query: 483 FREPYFAIKVRLRLPITLPAMSWKTIjALKLG NETTPSETVSLYDDSNQCLENGF 536 

FR PY+A L+T S h + G +T+ +DS LEN 

Sbjct: 357 FRRPYYA RELEVTFSYDSDLYLGYECGFLVPVEEKQTEARKELIGDPSMNTLENEA 412 
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Query: 


537 


Sbjct: 


413 


Query: 


597 


Sbjct: 


473 


Query: 


657 


Sbjct: 


533 


Query: 


717 


Sbjct: 


592 


Query: 


777 


Sbjct: 


650 




836 


Sbjct: 


710 



LKVMIQTDGRLTITDKQSGLIYQDLLRFEDCGDIGITEYISRQPNHDQPFYADQGTIKLNI 596 
+KVMI +G +1 DK +G Y+ L +ED GDIGNEY+ + + + + +1 
MKUMIHRNGSYSILDKTTGFEYRHLGIYEDVGDIGNEYIvIFKASSDGVRYTTEACEASIRI 472 



A +EI QT ++P +AD+ L+ E 



+R+A RS4E+ ++TL T + +E+ 



DN +HR+R LF? +H ADSI+E V+RPN PD W+NP+ 



VSL +GE G+TI GL+EYEI+ D +IA+TLLRSVGE+GDWG F TPEAQC G 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



Example 2830 

A DNA sequence (GASxl589R) was identified in S.pyogenes <SEQ ID 8159> which 
acid sequence <SEQ ID 8160>. Analysis of this protein sequence reveals the following: 
Possible site: 31 



3 have a cleavable I 
Likelihood =-11 
INTEGRAL Likelihood = -8 
INTEGRAL Likelihood = -3 
INTEGRAL Likelihood = -3 
INTEGRAL Likelihood = -2 
INTEGRAL Likelihood = -2 



rm signal seq. 

Transmembrane 203 - 3 
Transmembrane 61 - 

Transmembrane 107 - 3 
Transmembrane 39 - 

Transmembrane 129-3 
Transmembrane 89 - 3 



195 - 221! 



■ Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



-- Certainty=0. 5522 (Affirmative) 
— Certainty=0.0000 (Not Clear) < 
-- Certainty=0. 0000 (Not Clear) < 



No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAC10175 GB:AJ278302 histidine kinase [Streptococcus pneumoniae] 
Identities = 114/432 (26%) , Positives = 219/432 (50%) , Gaps = 10/432 (2%) 

Query: 21 LTLKLFSFVSAIPLRLKNIFYLSLSMVLFQVVFWAFFPDHFILDVVMLAQF LFFALI 77 

L + +F V I L + IF L +L WF +++ V L+ F L+ + 

Sbjct: 16 LKIVIFFKVDGISLTFERIFKAFIjFKILLAVVFGML GYMVGNVYLSYFMEPLYGIGL 72 

Query: 78 ALYYGKSIKAKFLMFYAFFPLVSISLVKRFIWFV[ffiLFGMPYSWKHNTLLIYSITCFS 137 

+ + + K L+]?Y FP++ ++L R + +FV+P G V + + I F+ 
Sbjct: 73 SFLLLRELPKKLLLFYGLFPMILvNLFYRGVSYFVLPFLGQ<3-QVYDDYSFIWLCIIIFN 131 

Query: 138 IFLIYRCIQVBHFDFSTWRQYFQSHRaSKLLVPTTNSSMALYYLCVQGIDVMSPSLSGLAT 197 

F+ ++ +DF++ R+ K L N M YYL +Q + G+ + 

Sbjct: 132 FFISLAFLKWLDYDFTSLRKGIIJ3KDFQ1SLTQISWIMGAYYLVIQNLSYFEYE-QGIQS 190 
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Query: 198 TTARSI I vTjFYFILFLTLIjXHLERYVKQNS IEAIVQQKE - - YRELINYSQHLGLLYQDIQ 255 

TT R +I++FY + F+ ++ h+ Y+K E + Q+++ YRE+ YS+H+ LY++++ 
Sbjct: 191 TTVRHLILVFYLLFFMGIIKKLDTYLKDKLHERLNQEQDLRYREMERYSRHIEELYKEVR 250 

Query: 256 ELRRLLTWSSPJLKIGIEQITOISIvRLTYEGIIiNAEKN^ 315 

R T + + L++GIE+ D+ ++ Y+ +L +D++ DL L ++ A++ 

Sbjct: 251 SFRHDYTNLLTSLRLGIEEEDMEQIKEIYDSVLKDSSEKLQDNKYDLGRLVNVRDRMjKS 310 

Query: 316 IVIJUCDIEAKNKKLKVEVSIPNCIATFFLEVVDFTKLLSFLLDNAIEMSLETKQPCLSIA 375 

++ K I+A++K + V +P I + ++DF ++S L DNAIE S+E QP +SIA 
Sbjct: 311 LLAGKFIKARDKNIVFNVEVPEEIQVEGVSLLDFI.TVVSILCDNAIEASVEACQPHVSIA 370 

Query: 376 FLDQNHKLVIVIQSSTKQGQDDSQSVFAIPALKKRDDWQFDLRNVTTILNRYDYLTISSQ 435 

F + +1++S K+ D +F+ A K ++ L V 1+ + ++++ 

Sbjct: 371 FFKNGAQETFI IENS IKEEGIDISEIFS FGAS S KGEERGVGLYTVMKI VESHPNTSLNTT 430 

Query: 436 IHDGILTQLIEI 447 

D + Q++ + 
Sbjct: 431 CQDHVFRQVLTV 442 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2831 

A DNA sequence (GASxl593R) was identified in S.pyogenes <SEQ ID 8161> which encodes the amino 
acid sequence <SEQ ID 8162>. Analysis of this protein sequence reveals the following: 

Possible site: 25 



Final Results 

bacterial membrane — Certainty=0 . 1510 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

35 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
40 antigens for vaccines or diagnostics. 

Example 2832 

A DNA sequence (GASxl594) was identified in S.pyogenes <SEQ ID 8163> which encodes the amino acid 
sequence <SEQ ID 8164>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

10 N-terminal signal sequence 

92 { 76 - 92) 



Final Results 

bacterial membrane Certainty=0 .2572 (Affirmative) ■ 

bacterial outside Certainty=0 . 0000 (Not Clear) < i 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < i 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF61313 GB:U96166 unknown [Streptococcus cristatus] 
Identities = 31/66 (46%) , Positives = 40/66 (59%) , Gaps = 2/66 (3%) 

5 Query: 14 LLGRILSKYVGRLTSCIENETTKIEMISRQl^'IGLtraLLGNLKTVHNPEIILKTINVYS 73 

+ G +SK + + E K+ ++ ND IG N LLG+LKTVHNPEII + VYS 
Sbjct: 30 VFGMDVSKXSSEVAILVNGE - - KVHGYTILNDAIGFNRLLGDLKTVHNPEI I FEATGVYS 87 

Query: 74 RRLQVF 79 
10 RRLQ F 

Sbjct: 88 RRLQAF 93 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

15 Example 2833 

A DNA sequence (GASxl598) was identified in S.pyogenes <SEQ ID 8165> which encodes the amino acid 

sequence <SEQ ID 8166>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

20 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2117 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

25 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
30 antigens for vaccines or diagnostics. 

Example 2834 

A DNA sequence (GASxl608) was identified in S.pyogenes <SEQ ID 8167> which encodes the amino acid 
sequence <SEQ ID 8168>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

35 

>» Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

40 bacterial outside Certainty=0.0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2835 

A DNA sequence (GASxl619) was identified in S.pyogenes <SEQ ID 8169> which encodes the amino acid 
sequence <SEQ ID 8170>. Analysis of this protein sequence reveals the following: 

Possible site: 36 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2916 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2836 

A DNA sequence (GASxl621) was identified in S.pyogenes <SEQ ID 8171> which encodes the amino acid 
sequence <SEQ ID 8172>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1899 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

= 153/211 (72%) 

ECEITIKEaVAHVKDGDTIIWGGFMTOGTPEKLIDALVEKGVKDLTLICNDAGFPDKGVGK 66 
K +T+++A +DG TIMVGGFM GTP +L++AL+E GV+DLTLI ND F D G+G 
KLMTLQDATGFFRDGMTIMVGGFMGIGTPSRLVEALLESGVRDLTLIANDTAFVDTGIGP 63 

MVANKQFSTIIASHIGLNREAGRQMTEGETVIDLVPQGTLAERIRSGGFGLGGFLTPTGI 126 
++ N + +IASHIG N E GR+M GE + LVPQGTL E+IR GG GLGGFLTPTG+ 



Query: 


7 






Query: 


67 


Sbjct: 


64 


Query: 


127 


Sbjct: 


124 




187 


Sbjct: 


184 


sed on 


this 



r +GK+ +T+DGK +LLE+PL+AD+ALI A++ D GNL Y S NFN ++A 



T+VE E+V+ G++ P+ + TPG 



antigens for vaccines or diagnostics. 
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Example 2837 

A DNA sequence (GASxl622) was identified in S.pyogenes <SEQ ID 8173> which encodes the amino acid 
sequence <SEQ ID 8174>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

5 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4668 (Affirmative) < suco 

10 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

15 >GP:AAD54948 GB:AF157306 acetoacetate :butyrate/acetate coenzyme A 

transferase [Clostridium beijerinckii] 
Identities = 121/214 (56%) , Positives = 161/214 (74%) , Gaps = 5/214 (2%) 





7 


V1SKEEIQTRIAKRVAQELEHOTLVNLGIGLPTKVANYIPEGVTITLQSENGFVGLTGLT 66 






VL+KE I AKRVA+EL+ LVNLGIGLPT VANY+P+ + IT +SENG VG+ + 


Sbjct: 


6 


VLAKEII AKRVAKELKKGQLVNLGIGLPTLVANYVPKEMNITFESENGMVGMAQMA 61 




67 


DD-HYDPTIVNAGGQPVSIAPGGAFFDSSTSFGIIRGGHVAATVLGALQVDKEASIANYL 125 






DP I+NAGG+ V++ P GAFFDSSTSF +IRGGKV VLGAD+VD+E ++AN++ 


Sbjct: 


62 


SSGENDPDIINAGGEYVTLLPQGAFFDSSTSFALIRGGHVDVAVLGALEVDEEGNIiANWI 121 




126 


IPGHWPGMGGAMDLLVGAKKVIVAMEHTNKGKAKILDKCTLPLTAQNVVNLIITEMGVF 185 






+P K+VPGMGGAMDL +GAKK+IVAM+HT KGK KI+ KCTLPLTA+ V+LI+TE+ V 


Sbjct: 


122 


VPNKIVPGMGGAfCiIAIGAKKIIvAMQHTGKGKPKIVKKCTLPLTAKAQVDLIVTEriCVI 181 




186 


EYQDEGLCALEINPDYTFEDVQNVTEVTLIDKTN 219 






+ ++GL EI+ D T ++++ +T+ LI N 


Sbjct: 


182 


DVTNDGLLFREIHKDT T I DE I KFLTDADLI IPDN 215 



35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2838 

A DNA sequence (GASxl628R) was identified in S.pyogenes <SEQ ID 8175> which encodes the amino 
acid sequence <SEQ ID 8176>. Analysis of this protein sequence reveals the following: 

40 Possible site: 17 

>» Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0 . 1243 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

50 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2839 

A DNA sequence (GASxl639R) was identified in S.pyogenes <SEQ ID 8177> which encodes the amino 
acid sequence <SEQ ID 8178>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

5 

>>> Seems to have an uncleavable N-term sigr.al seq 

INTEGRAL Likelihood = -8.65 Transmembrane 55 - 71 ( 44 - 73) 
INTEGRAL Likelihood = -7.64 Transmembrane 13- 29 ( 5- 31) 

10 Final Results 

bacterial membrane Certainty=0 .4461 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2840 

20 A DNA sequence (GASxl643) was identified in S.pyogenes <SEQ ID 8179> which encodes the amino acid 
sequence <SEQ ID 8180>. Analysis of this protein sequence reveals the following: 

Possible site: 35 

>>> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0. 0766 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2841 

A DNA sequence (GASxl645R) was identified in S.pyogenes <SEQ ID 8181> which encodes the amino 
acid sequence <SEQ ID 8182>. Analysis of this protein sequence reveals the following: 

Possible site: 18 

40 >» Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside — Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

45 bacterial cytoplasm Certainty=0. 0000 (Not Clear) suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2842 

A DNA sequence (GASxl649R) was identified in S.pyogenes <SEQ ID 8183> which encodes the amino 
5 acid sequence <SEQ ID 8184>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

>>> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0. 0931 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

1 5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2843 

20 A DNA sequence (GASxl650) was identified in S.pyogenes <SEQ ID 8185> which encodes the amino acid 
sequence <SEQ ID 8186>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>» Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm --- Certainty=0. 5678 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2844 

A DNA sequence (GASxl651R) was identified in S.pyogenes <SEQ ID 8187> which encodes the amino 
acid sequence <SEQ ID 8188>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

40 >:» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2761 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

45 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2845 

5 A DNA sequence (GASxl667R) was identified in S.pyogenes <SEQ ID 8189> which encodes the amino 
acid sequence <SEQ ID 8190>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

»> Seems to have no N- terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0 .2967 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2846 

A DNA sequence (GASxl672) was identified in S.pyogenes <SEQ ID 8191> which encodes the amino acid 
sequence <SEQ ID 8192>. Analysis of this protein sequence reveals the following: 
Possible site: 24 

25 »> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -3.82 Transmembrane 3 - 19 ( 1-20) 

Final Results 

bacterial membrane Certainty=0 . 2529 (Affirmative) < suco 

30 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2847 

A DNA sequence (GASxl673R) was identified in S.pyogenes <SEQ ID 8193> which encodes the amino 
acid sequence <SEQ ID 8194>. Analysis of this protein sequence reveals the following: 

40 Possible site: 38 

>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -8.86 Transmembrane 51 - 67 ( 47 - 75) 

INTEGRAL Likelihood = -5.20 Transmembrane 27 - 43 ( 24 - 45) 

45 INTEGRAL Likelihood = -3.66 Transmembrane 112 - 128 ( 112 - 131) 

Final Results 
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bacterial membrane Certainty=0. 4545 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF41294 GB:AE002440 conserved hypothetical protein [Neisseria 
meningitidis MC58] 
Identities = 61/148 (41%) , Positives = 96/148 (64%) 

Query: 1 LKKSITNEKAILAQGGQEFGAQNTKFLTLLHIMIWFAVIFALLKQIKFDGISFLGLLLM 60 

L SI +EKA++A+G +++G N+ L +H + Y+ + L F+GIS +G L + 
Sbjct: 19 IiAVSIKHEKALIAKGAKQYGKTNSTLLAAVHTLYYIACFVWVWLSDTAFNGISLIGTLTV 78 

Query: 61 LLSVAVLYEVTRILGDIWIVKLMIAKDHKYVDffl^LFKTIKHPNYFIjNIAPEIiVGIALLCH 120 

+ S +L + + LG+IWTVK+ + +H+ WLFKT +HPNYFLNI PEL+GIALLC 
Sbjct: 79 RASFVILSLIIKQLGEIWTVKIYILPNHQINRSWLFKTFRHPNYFLNIIPELIGIALLCQ 138 

Query: 121 AKITAMLLFPCYIWIYLRIREENKLLA 148 

A ++ P Y++V++ RIR+E + +A 
Sbjct: 139 AWYVLLIGLPIYLLVLFKRIRQEEQAMA 166 

A related GBS gene <SEQ ID 9009> and protein <SEQ ID 9010> were also identified. Analysis of this 
protein sequence reveals the following: 

Lipop: Possible site: -1 Crend: 0 

McG: Discrim Score: 5.86 

GvH: Signal Score (-7.5): 0.14 
Possible site: 60 

>» Seems to have a cleavable N-term signal seg. 

ALOM program count: 2 value: -8.23 threshold: 0.0 

INTEGRAL Likelihood = -8.23 Transmembrane 69 - 85 ( 64 - 89) 
INTEGRAL Likelihood = -3.29 Transmembrane 142 - 158 ( 140 - 159) 
PERIPHERAL Likelihood = 1.70 123 
modified ALOM score: 2.15 

*** Reasoning Step: 3 

Final Results 

bacterial membrane Certainty=0. 4291 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 



The protein has homology with the following sequences in the databases: 

42.1/64.0% over 168aa 

45 imported 

EGAD 1 177248 1 conserved hypothetical protein {Neisseria meningitidis} Insert characterized 
GP|7379797|emb|CAB84365.l| |AL162755 putative integral membrane protein {Neisseria 
meningitidis} Insert characterized 

GP| 7226121 |gb|AAF41294.1 1 |AE002440 conserved hypothetical protein {Neisseria meningitidis 
50 MC58} Insert characterized 

PIR|F81147|F81147 probable integral membrane protein NMA1102 - Neisseria meningitidis 
(group B strain MD58, group A strain Z2491) Insert 
characterized 

55 ORF00432(301 - 807 of 1140) 

EGAD| 177248 |NMB0883(1 - 169 of 169) conserved hypothetical protein {Neisseria 
meningitidis}GP|7379797|emb|CAB84365.l| |AL162755 putative integral membrane protein 
{Neisseria meningitidis }GP| 7226121 |gb|AAF41294 . l| |AE002440 conserved hypothetical protein 
{Neisseria meningitidis MC58}PIR| F81147 | F81147 probable integral membrane protein NMA1102 

60 [imported] - Neisseria meningitidis (group B strain MD58, group A strain Z2491) 

%Match =19.0 

%Identity =42.0 %Similarity =53.9 

Matches = 71 Mismatches = 61 Conservative Sub.s = 37 
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237 267 297 327 357 387 417 447 

SSGEYHLLTSDHSLV*IGKAXX*LIXXXEFTMSIIIGLMAAMFIIRLAYLKi^ 

==: Nil =1 :|| HIM 111=11 1 = =1 
5 MTMILSILSLFFIIRLLFIAVS1KHEKALIAKGAKQYGKTNSTLLAAVH 
10 20 30 40 

477 507 537 567 597 627 657 687 

IIIYFSSOTiaiLTKASFNFVSVIGLSLMIFSVFMLHTVTRLLGRIWTO^ 
10 , |:s |s :|| :|:|| :: I :| 11 11111= III 111= HIIIIIII II 

TLYYIACFVWVWLSDTAFNGISLIGTLTVMASFVILSLIIKQIXSEIWTVKIYILPmQINRSWLFKTFRHPJTOFLNIIPE 
60 70 80 90 100 110 120 

717 747 777 807 837 867 897 927 

15 LLGVTLLCHAKYTALFVLPIYAFVIYLRIREENLLLKTIIIFlCGIiaCSRVY*E*DK**T*KSFFVILSQ*EEVFISCFFS 
|:|: 111 = 1 I I llll H = : 11M = 1 = 
LIGIALLCQAWYVLLIGLPIYLLVLFKRIRQEEQAMATLF 
140 150 160 

20 Based on this analysis, it was predicted that these proteins and their epitopes could be useful antigens for 
vaccines or diagnostics. 

Example 2848 

A DNA sequence (GASxl674R) was identified in S.pyogenes <SEQ ID 8195> which encodes the amino 
acid sequence <SEQ ID 8196>. Analysis of this protein sequence reveals the following: 

25 Possible site: 23 

>» Seems to have no N-ternrinal signal sequence 

Final Results 

30 bacterial cytoplasm Certainty=0. 3098 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2849 

A DNA sequence (GASxl677R) was identified in S.pyogenes <SEQ ID 8197> which encodes the amino 
40 acid sequence <SEQ ID 8198>. Analysis of this protein sequence reveals the following: 

Possible site: 33 



45 



50 



Seems to have no N- terminal signal sequence 










INTEGRAL 


Likelihood = 


-8.86 


Transmembrane 


254 


270 


248 


280 


INTEGRAL 


Likelihood = 


-7.01 


Transmembrane 


303 


319 


296 


322 


INTEGRAL 


Likelihood = 


-2.39 


Transmembrane 


74 


90 


74 


91) 


INTEGRAL 


Likelihood = 


-1.91 


Transmembrane 


201 


217 


199 


217 




Likelihood = 


-1.91 


Transmembrane 


223 


239 


220 




INTEGRAL 


Likelihood = 


-1.65 


Transmembrane 


118 


134 


115 


135 


INTEGRAL 


Likelihood = 


-1.49 


Transmembrane 


56 


72 


55 


72 


INTEGRAL 


Likelihood = 


-0.32 


Transmembrane 


13 


29 




30 



Final Results 

bacterial membrane Certainty=0. 4545 (Affirmative) < suco 

55 bacterial outside --- Certainty=0. 0000 (Not clear) < suco 
bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB05126 GB:AP001511 unknown conserved protein [Bacillus halodurans] 
5 Identities = 249/534 (46%) , Positives = 380/534 (70%) 

Query: 12 QDIAFHFFGGLGLFLFSIKYMGDGLQQAAGDKLRYYIDKYTSNPFFGILVGIAMSALIQS 71 

Q + F FFGGLG+FLF IKYMGDGLQ+ AG++LR +DK+T+NP G+L GI ++ L+Q+ 
Sbjct: 6 QTLLFMFFGGLGIFLFGIKMGDGLQKVAGERLRDLLDKFTTKPLMGVLAGIVVTVLLQT 65 

10 

Query: 72 SSGVTVITVGLVSAGLLNLRQAIGIVMGAN1GTTITSFLIGFKLGDYALPMIFIGAACLF 131 

S+G TV+T+GLV+AG + L+QAIG++MGANIGTT+T+F+IG K+ +YALP+I +GAA +F 
Sbjct: 66 STGTTVLTIGLVNAGFMTLKQAIGVIMGAKIGTTi/TAFIIGIKISEYALPIIAVGAALIF 125 

15 Query: 132 FTSNKKLNNFGRIIFGVGGIFFSLNLMGBAMDPLKSVSAFQNYIATLGDKPFQGVFIGTA 191 

F NKK+NN G++IFG G +F+ LN MG+ ++PL+ + AF + ++ + P GV IGT 
Sbjct: 126 FIKNKKVNNIGQVIFGFGTLFYGLNTMGEGLNPJjRELQAFADLTVSMSENPLLGVLIGTI 185 

Query: 192 LTMLIQSSAAIIGILQGLFSGGLLTLQGAIPILLGSNIGTCITAVIAAIGSNIAAKRVAA 251 
20 T +QSS+A IG+LQ L+ G + L A+P+L G NIGT ITAVLAAIG+++AAKR A 

Sbjct: 186 FTAAVQSSSASIGLLQQLYDQGA^LFAM^PVLFGDNIGTTITAVLAAIGASVAAKRAAL 245 

Query: 252 AHVLFNLIGTI IFMI ILVPFTSLMLWLQSKLSLTPEMTIAFSHGSFNITNTILLI PFI SL 311 
HV+FNLIGTII +II++PFT + +L +L MTIAF+HG FN++NTI+ PFI + 
25 Sbjct: 246 THVIFNLIGTI IVLI III PFTHFIAYLAEVFALNRPMTIAFAHGIFNVSNTI IQFPFIGI 305 



Query: 312 IAMIVTRLIPGEDEVVKYEALYLDRLLITQAPSIALGNAHKELVHLASYAIQAFEASYSY 371 

IA+IVT+L+PG+D ++Y+A +LD + +P+IALG A +E++ +A ++ + Y 
Sbjct: 306 IAIIVTKLVPGDDFYIEYKAKHLDPRFVGSSPAIALGQAKQEVLRMAEFSEKGLLEVSKY 365 

Query: 372 IMTADGKFGEKTOEYEPAVDTIDEELTTYLVDISNEALSPSElsffiVLAGILDSSRDLERIG 431 

+ K E ++E A++ +D ++T YL+ IS+ +LS ++++ ++D+ RD+ERIG 

Sbjct: 366 MENGQKKHAEMAVQFEDAINNLDRKITEYLISISSRSLSAQDSKMHGMLMDTVRDIERIG 425 

Query: 432 DHSESLGILIEGIISKQIGFSISARQELTEMYQLTHCLTLDAIRAIVDSDTDLAQTIVTR 491 

DH E++ L + + ++ S A +L EM+ LTH +AI ++ D + A++++ + 
Sbjct: 426 DHIENIVELKDYQKANKVKISEKALHDLQEMFDLTHSTLTEAIMSLETGDLEAARSVIEK 485 

Query: 492 HKEIEEKERRLRKTHIKRLNCGECTAQAGINFIDIISHYTRITDHALNLAEKVL 545 

+ I++ ER+LRK HI R+N G CT AGI F+DI+S+ RI DH++N+AE V+ 
Sbjct: 486 EEHIDQMERKLRKQHIIRWEGNCTGAAGIVFVDIVSNLERIGDHSVNIAEAVI 539 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



45 Example 2850 

A DNA sequence (GASxl678R) was identified in S.pyogenes <SEQ ID 8199> which encodes the amino 
acid sequence <SEQ ID 8200>. Analysis of this protein sequence reveals the following: 

Possible site: 48 

50 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2940 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

55 bacterial outside Certainty=0 .0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2851 

A DNA sequence (GASxl685R) was identified in S. pyogenes <SEQ ID 820 1> which encodes the amino 
acid sequence <SEQ ID 8202>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

an uncleavable N-term signal seq 

13 - 23 ( 9 - 31) 

10 

Final Results 

bacterial membrane Certainty=0.384S (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2852 

A DNA sequence (GASxl695R) was identified in S.pyogenes <SEQ ID 8203> which encodes the amino 
acid sequence <SEQ ID 8204>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

25 »> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .1357 (Affirmative) < succ> 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2853 

A DNA sequence (GASxl698) was identified in S.pyogenes <SEQ ID 8205> which encodes the amino acid 
sequence <SEQ ID 8206>. Analysis of this protein sequence reveals the following: 

Possible site: 33 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .1970 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 000 o (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2854 

5 A DNA sequence (GASxl713) was identified in S.pyogenes <SEQ ID 8207> which encodes the amino acid 
sequence <SEQ ID 8208>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have no N-terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0. 3092 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2855 

A DNA sequence (GASxl737) was identified in S.pyogenes <SEQ ID 8209> which encodes the amino acid 

sequence <SEQ ID 8210>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

25 »> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1878 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2856 

A DNA sequence (GASxl748R) was identified in S.pyogenes <SEQ ID 8211 > which encodes the amino 
acid sequence <SEQ ID 8212>. Analysis of this protein sequence reveals the following: 

Possible site: 23 

40 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty^O .2841 (Affirmative) < suco 

45 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2857 

A DNA sequence (GASxl750R) was identified in S.pyogenes <SEQ ID 8213> which encodes the amino 
acid sequence <SEQ ID 8214>. Analysis of this protein sequence reveals the following: 

Possible site: 59 



• Final Results 

bacterial membrane --- Certainty=0 . 1489 (Affirmative) < succ: 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2858 

A DNA sequence (GASxl754) was identified in S.pyogenes <SEQ ID 821 5> which encodes the amino acid 
sequence <SEQ ID 8216>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2859 

A DNA sequence (GASxl759) was identified in S.pyogenes <SEQ ID 8217> which encodes the amino acid 
sequence <SEQ ID 8218>. Analysis of this protein sequence reveals the following: 

Possible site: 36 

>» Seema to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0 .1534 (Affirmative) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2860 

A DNA sequence (GASxl764R) was identified in S.pyogenes <SEQ ID 8219> which encodes the amino 
acid sequence <SEQ ID 8220>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

»> Seems to have a cleavahle N-term signal seq. 

INTEGRAL Likelihood = -6.74 Transmembrane 90 - 106 ( 87 - 121) 

INTEGRAL Likelihood = -4.57 Transmembrane 210 - 226 ( 205 - 229) 

INTEGRAL Likelihood = -4.19 Transmembrane 43 - 59 ( 42 - 62) 

INTEGRAL Likelihood = -3.77 Transmembrane 137 - 153 ( 137 - 155) 

Final Results 

bacterial membrane Certainty=0. 3697 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2861 

A DNA sequence (GASxl768R) was identified in S.pyogenes <SEQ ID 822 1> which encodes the amino 
acid sequence <SEQ ID 8222>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-12.37 Transmembrane 26 - 42 ( 17 - 

INTEGRAL Likelihood = -7.54 Transmembrane 53 - 69 ( 46 - 

INTEGRAL Likelihood = -3.29 Transmembrane 209 - 225 ( 209 - 225! 

INTEGRAL Likelihood = -2 . 13 Transmembrane 82 - 98 ( 82 - 

INTEGRAL Likelihood = -1.65 Transmembrane 9 - 25 ( 9-25' 

INTEGRAL Likelihood = -0.85 Transmembrane 117 - 133 ( 117 - 134! 

Final Results 

bacterial membrane Certainty=0 . 5946 (Affirmative) < succ 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB84959 GB:AE000829 conserved protein [Methanobacterium 
thermoautotrophicum] 
Identities = 54/192 (28%) , Positives = 90/192 (46%) , Gaps = 6/192 (3%) 



WO 02/34771 



PCT/GB01/04789 



-2843- 







TKLLLLVIjAI^CFFFRVDGFLE?IIVIFLLLLLSALNKKK]^--FKLAVVYLLMIGLSVI 












Sbjct: 


32 


SKLTVVVSATLLSTFISDLTLLIIMGVIFTMiIAHSGSLRFflAPFLSPIILFWLVSLft.il 


91 


Que 
uery. 


65 


PLSIFPSYIiDHLLSFVSIAGRLVFPSLLAGLITIKTTTIYELVHGLRKWRFPEWLLTIA 


124 






+ S H + F+S+ F AGL TT +L LR R P + TL 




Sbjct: 


92 


MVL SGNPHTMGFLSLFFARFFIISAAGLSFAFTTEPQKLAESLRSVRIPGEIVFTLT 


148 


Query: 


125 


VMCRFIPMIRQECCVIHRSLKIRGIILTKWSILIRPKQYLEYLMVPLLLSLIRSSQELTI 


184 






V R+IP + E I SLK+R L+ SI+ RP L++P+++ ++ S E+ I 




Sbjct: 


149 


VALRYIPALAVEASSIWDSLKLR-TSLSGSSIIRRPSLLYRGLIIPMIIRTVKISDEVAI 


207 




185 


ASLTKGL&VNKG 196 








A+ T+G +G 




Sbjct: 


208 


AAETRGFNPREG 219 





Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2862 

20 A DNA sequence (GASxl769R) was identified in S.pyogenes <SEQ ID 8223> which encodes the amino 
acid sequence <SEQ ID 8224>. Analysis of this protein sequence reveals the following: 

Possible site: 16 



> Seems to have a cleavable N-t 



25 INTEGRAL Likelihood = -7. 

INTEGRAL Likelihood = -4. 

INTEGRAL Likelihood = -3, 

INTEGRAL Likelihood = -2. 

INTEGRAL Likelihood = -2. 

30 integral Likelihood = -1, 



i signal seg. 

32 Transmembrane 164 - 180 ( 158 - 186) 

67 Transmembrane 85 - 101 ( 84 - 105) 

03 Transmembrane 42 - 58 ( 42 - 61) 

76 Transmembrane 118 - 134 ( 117 - 134) 

i4 - 80 { 64 - 82) 

.8 - 34 ( 17 - 34) 



Final Results 

bacterial membrane Certainty=0 .3930 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
The protein has no significant homology with any sequences in the GENPEPT database. 
Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
40 antigens for vaccines or diagnostics. 



Example 2863 

A DNA sequence (GASxl776R) was identified in S.pyogenes <SEQ ID 8225> which encodes the amino 
acid sequence <SEQ ID 8226>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -6.37 Transmembrane 4 - 20 ( 1-22) 
INTEGRAL Likelihood = -0.43 Transmembrane 261 - 277 ( 261 - 278) 

le Certainty=0. 3548 (Affirmative) < suco 

bacterial outside — - Certainty=0 . 0000 (Not clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not clear) < suco 



55 No corresponding DNA sequence was identified in S.agalactiae. 



WO 02/34771 PCT/GB01/04789 
-2844- 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2864 

A DNA sequence (GASxl777R) was identified in S.pyogenes <SEQ ID 8227> which encodes the amino 
acid sequence <SEQ ID 8228>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

10 N- terminal signal sequence 

1217 -1233 (1215 -1235) 



Final Results 

bacterial membrane Certainty=0. 4270 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF53254 GB:AE003639 CG16974 gene product [Drosophila 
melanogaster] 

Identities = 84/238 (35%) , Positives = 133/238 (55%) , Gaps = 10/238 (4%) 

Query: 516 LRLDHYELTDI SLL - - KHAKNITELHLDGNQITEI PKELFSQMKQLRFLNLRSNHLTYLD 573 

L + L++ SLL ++ K + ELHLD +++T +P+ ++ +LR LNL N LT L 
Sbjct: 232 LEMSGNRLSNCSLLNLQWIKQLQELHLDRSELTYLPQRFLGELSELRMLNLSQNLLTELP 291 

Query: 574 KDTFKSNAQLRELYLSSNFIHSLEGGLFQSLHHLEQLDLSKNRIGRLCDNPFEGLSRLTS 633 

+D F +L LYLS N + L LFQ+ L+ LDLS NR+ DN F 4L 
Sbjct: 292 RDIFVGALKLERLYLSGNRLSVLPFMLFQTAADLQVLDLSDNRLLSFPDNFFARNGQLRQ 351 

Query: 634 LGFAENSLEEIPEKALEPLTSLNFIDLSQNNLALLP-KTIEKLRALSTIVASRNHITRID 692 

L N L+ I + +L L L +DLSQN+L+++ K E L L + S N++T + 
Sbjct: 352 LHLQKNQLKSIGKHSLYSLRELRQLDLSQNSLSVIDRKAFESLDHLLALNVSGNNLTLLS 411 

Query: 693 NISFKNLPKLSVLDLSTNEISNLPNGIFKQNNQL TKLDFFNNLLTQVEESV 743 

+ 1 F++L L LDLS N+ LP+G+F++ L T ++ F+N +++ +ES+ 

Sbjct: 412 SIIFQSLHALRQLDLSRNQFKQLPSGLFQRQRSLVLLRIDETPIEQFSNWISRYDESL 4S9 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 286S 

A DNA sequence (GASxl778R) was identified in S.pyogenes <SEQ ID 8229> which encodes the amino 
acid sequence <SEQ ID 8230>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1067 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2866 

A DNA sequence (GASxl779) was identified in S.pyogenes <SEQ ID 8231> which encodes the amino acid 
sequence <SEQ ID 8232>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm — Certainty=0 . 1885 (Affirmative) < suco 

bacterial membrane Certainty=0 . 00D0 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2867 

A DNA sequence (GASxl786R) was identified in S.pyogenes <SEQ ID 8233> which encodes the amino 
acid sequence <SEQ ID 8234>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0S12 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2868 

A DNA sequence (GASxl790) was identified in S.pyogenes <SEQ ID 8235> which encodes the amino acid 
sequence <SEQ ID 8236>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

>» Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside --- Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 



Example 2869 

A DNA sequence (GASxl791R) was identified in S.pyogenes <SEQ ID 8237> which encodes the amino 
acid sequence <SEQ ID 823 8>. Analysis of this protein sequence reveals the following: 

Possible site: 43 



--- Certainty=0. 1362 (Affirmative) . 



bacterial outside Certainty=0 . 0000 (Not Clear) 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

A related sequence was also identified in GAS <SEQ ID 9155> which encodes the amino acid sequence 
<SEQ ID 9156>. Analysis of this protein sequence reveals the following: 

a signal seq. 

Final Results 

bacterial outside Certainty= 0 . 300 (Affirmative) < suco 

bacterial membrane Certainty= 0.000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0.000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA24923 GB:L06331 endoglycosidase [Chryseobacterium 
meningosepticum] 

Identities = 105/322 (32%) , Positives = 153/322 (46%) , Gaps = 53/322 (16%) 

ADKQAQELAKMKI PEKI PMKPLHGSLYGGYFRTWHDKTSDPTEKDKVNSMGELPKEVDLA 165 
A K ++ + + IK + GY+RTW D T + EM LP +D+ 

AQKSGVTVSAVNLSNLIAYKNSDHQISAGYYRTWRDSA TASGNLPSMRWLPDSLDMV 93 



Query: 


106 


Sbjct: 


37 


Sbjct: 


165 
94 




226 


Sbjct: 




Query: 


286 


Sbjct: 


196 




344 


Sbjct: 






404 


Sbjct: 


287 



--TTTGGQDS 143 



- -SSPSGATLTKFVAATKALSKYFGP 195 



++NP + AP N + +Q YG R + 

- -NQNPINFFIQTAPRYNYVFLQAYG RSTTOL 23 7 



Y+ YI +Q++ GFSFYEEN GN W D+ + 
TTVSGLYAPYISMKQFLPGFSFYEENGYPGNYWNDVRYPQ-- 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2870 

A DNA sequence (GASxl803) was identified in S.pyogenes <SEQ ID 8239> which encodes the amino acid 
5 sequence <SEQ ID 8240>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

>» Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certainty=0 . 2099 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2871 

20 A DNA sequence (GASxl806R) was identified in S.pyogenes <SEQ ID 8241 > which encodes the amino 
acid sequence <SEQ ID 8242>. Analysis of this protein sequence reveals the following: 
Possible site: 54 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0 . 2706 (Affirmative) < suco 

bacterial membrane Certainty=D . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB16126 GB:Z99124 ribosomal protein S18 [Bacillus subtilis] 
Identities = 51/77 (66%) , Positives = 63/77 (81%) 

35 

Query: 1 MAQQRRGGFKRRKKVDFIAANKIEYVDYKDTEIiLSRFVSERGKILPRRVTGTSAKNQRKV 60 

MA RRGG +R+KV + +N 1 ++DYKD +LL +FVSERGKILPRRVTGT+AK QRK+ 
Sbjct: 3 MAGGRRGGRAKRRKVCYFTSNGITHIDYKDVDLLKKFVSERGKILPRRVTGTNAKYQRKL 62 

40 Query: 61 TTAIKRARVMALMPYVN 77 

T AIKRAR MAL+PYV+ 
Sbjct: 63 TAAIKRARQMALLPYVS 79 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
45 antigens for vaccines or diagnostics. 

Example 2872 

A DNA sequence (GASxl809R) was identified in S.pyogenes <SEQ ID 8243> which encodes the amino 
acid sequence <SEQ ID 8244>. Analysis of this protein sequence reveals the following: 

Possible site: 60 
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>>> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -7.59 Transmembrane 70 - 86 ( 66 - 92) 

INTEGRAL Likelihood = -6.42 Transmembrane 13 - 29 ( 8 - 33) 

INTEGRAL Likelihood = -5.68 Transmembrane 48 - 64 ( 43 - 69) 

Final Results 

bacterial membrane Certainty=0. 4036 (Affirmative) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2873 

A DNA sequence (GASxl813R) was identified in S.pyogenes <SEQ ID 8245> which encodes the amino 
acid sequence <SEQ ID 8246>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

»> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood =-10.51 Transmembrane 127 - 143 ( 113 - 147) 
INTEGRAL Likelihood =-10.46 Transmembrane 151 - 167 ( 149 - 167) 
INTEGRAL Likelihood = -4.41 Transmembrane 59 - 75 ( 57 - 77) 

Final Results 

bacterial membrane Certainty=0. 5203 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB98363 GB:U67490 lipoprotein B (lppB) [Methanococcus 
jannaschii] 

Identities = 43/143 (30%) , Positives = 68/143 (47%) , Gaps = 7/143 (4%) 



Query: 25 LLNVLLKIITGVMy--ILyPSFLIFTLWCGMTFQLI1LRLLIIPAVGFIALSyiRKRFDFP 82 

+ + ++ 11+ Y I S +IF + +L L + + F +L Y+ P 
Sbjct: 181 IFDAIMPIISKTAYPLXAITSLIIFIKNRKFGMKLIFALFLAFMIAF-SLKYLVNE---P 236 

Query: 83 RPYEKMNIKPLIDKDTKGRSMPSRHVFSATMISMCLLRYYVYFGIVCLILSALLA.ICRVI 142 

RPY + L+ + S PS H A ++ LL Y GI+ L + ++A RV 
Sbjct: 237 RPYLVLDNVHLLCNEGNEPSFPSGHTTLAFTLATSLLFYSKKLGILFLSWAIIVAYSRVY 296 

Query: 143 AGIHYPKDVIVGYLIGLMLGLCL 165 

G+HYP DV+ G +IG+ G CL 
Sbjct: 297 VGVHYPLDVLAGMIIGIFCG-CL 318 



A related GBS gene <SEQ ID 901 1> and protein <SEQ ID 9012> were also identified. Analysis of this 
50 protein sequence reveals the following: 

Lipop: Possible site: -1 Crend: 9 
McG: Discrim Score: 3.19 
GvH: Signal Score (-7.5): -2.18 
Possible site: 55 
55 >» Seems to have a cleavable N-term signal seq. 

ALOM program count: 3 value: -11.78 threshold: 0.0 
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INTEGRAL Likelihood =-11.78 
INTEGRAL Likelihood =-11.30 
INTEGRAL Likelihood = -4.41 
PERIPHERAL Likelihood = 3.29 
modified ALOM score: 2.86 



■ 142 ( 112 - 147) 

■ 166 ( 147 - 166) 
• 74 ( 56 - 75) 



1 Reasoning Step: 3 

— Final Results 

bacterial membrane - 
bacterial outside - 
bacterial cytoplasm - 



-- Certainty=0. 5713 (Affirmative) ■ 
-- Certainty=0 . 0000 (Not Clear) ■= i 
— Certainty=0. 0000 (Not Clear) < i 



The protein has homology with the following sequences in the databases: 

15 ORF01020(472 - 792 of 1098) 

EGAD]44548|MJ0374 (213 - 318 of 330) conserved hypothetical protein {Methanococcus 
jannaschiij OMNl|MJ0374 conserved hypothetical protein Sp] Q57819 | Y374_METJA HYPOTHETICAL 
PROTEIN MJ0374. GP | 1591081 | gb |AAB98363 . 1 | |U67490 lipoprotein B (lppB) {Methanococcus 
jannaschii} PIR| F64346 | F64346 hypothetical protein MJ0374 ~ Methanococcus jannaschii 

20 %Match =6.8 

%Identity =30.8 %Similarity =53.3 

Matches = 33 Mismatches = 49 Conservative Sub.s = 24 

222 252 232 312 342 372 402 432 

25 EGOTKYLRRNKOTKHFAYAPQNAGGSGATIVTLG*IKESYEQFYAKLSQPFRKSPQLIILIMFLLKIVTGMMYILYPSFL 

VIAWLSGIFEMHKLLFTVGTIIGRLPRFLAVAYFGDVLGNINRLSDINIYLFYLINSHYNYIFDAIMPIISKTAYPLIAI 
130 140 150 160 170 180 190 

30 462 492 522 552 582 612 642 672 

IFTLWQGMTFQLWLRLLIIPAVGF1ALSYIRKRLDFPRPYEKWNIKPLIYKDTEGRSMPSRHVFSATMISMCLLRYYVYF 

::|: : |: :: :: |||| = |: : I I I I I " III 

TSLIIFIKNRKFGMKLIFALFIAFMIAFSLKYLvNEPRPYLVX,DNVHLLCNEGNEPSFPSGHTTIAFTl^TSLLFYSKKL 
210 220 230 240 250 260 270 

35 

702 732 762 792 822 852 882 912 

GIVCLILSVLLAICRVIAGIHYPKDVIVGYLIGLILGLCLFI*RVRSK*FQKQLDSCTIGLSLR*NGEKRWH*K*QMLHL 
Ih I ::::| II hill 11= I =11= I II 
GILFLSWAIIVAYSRVYVGVHYPLDVLAGMIIGIFCG-CLTRIDIYKLIDNI 
40 290 300 310 320 330 

Based on this analysis, it was predicted that these proteins and their epitopes could be useful antigens for 
vaccines or diagnostics. 

Example 2874 

45 A DNA sequence (GASxl815R) was identified in S.pyogenes <SEQ ID 8247> which encodes the amino 
acid sequence <SEQ ID 8248>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

»> Seems to have no N-terminal signal sequence 

50 

Final Results 

bacterial cytoplasm Certainty=0. 0888 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=C . 0000 (Not Clear) < suco 

55 

No corresponding DNA sequence was identified in S.agalactiae. 



The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2875 

A DNA sequence (GASxl825R) was identified in S.pyogenes <SEQ ID 8249> which encodes the amino 
5 acid sequence <SEQ ID 825 0>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -0.16 Transmembrane 7 - 23 ( 7-23) 

10 

Final Results 

bacterial membrane Certainty=0. 1055 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2876 

A DNA sequence (GASxl832) was identified in S.pyogenes <SEQ ID 825 1> which encodes the amino acid 

sequence <SEQ ID 8252>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

25 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0918 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2877 

A DNA sequence (GASxl836R) was identified in S.pyogenes <SEQ ID 8253> which encodes the amino 
acid sequence <SEQ ID 8254>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

40 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .4084 (Affirmative) < suco 

45 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2878 

5 A DNA sequence (GASxl864R) was identified in S.pyogenes <SEQ ID 8255> which encodes the amino 
acid sequence <SEQ ID 8256>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

»> Seems to have no N-terminal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0. 5280 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC36810 GB:L12244 ribosomal protein L28 [Bacillus subtilis] 
Identities = 45/62 (72%) , Positives = 52/62 (83%) 

20 

Query: 1 MAKVCYFTGRKWSGNlffiSHAMNQTKRTOKPNLQKVTILVIDGKPKKVWASARALKSGKVE 60 

MA+ C TG+KT +GNNRSHAMN +KRT NLQKV ILV+GKPKKV+ SARALKSGKVE 
Sbjct: 1 MARKCVITGKKTTAGNNRSE&MNASKRTWGANLQKVRILVNGI^ 60 

25 Query: 61 RI 62 

R+ 

Sbjct: 61 RV 62 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
30 antigens for vaccines or diagnostics. 

Example 2879 

A DNA sequence (GASxl869) was identified in S.pyogenes <SEQ ID 8257> which encodes the amino acid 
sequence <SEQ ID 8258>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

35 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1858 (Affirmative) < suco 

40 bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2880 

A DNA sequence (GASxl881) was identified in S.pyogenes <SEQ ID 8259> which encodes the amino acid 
sequence <SEQ ID 8260>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

5 

»> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2752 (Affirmative) < succi 

10 bacterial membrane Certainty=0. 0000 (Hot Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

RGD motif 135-138 

No corresponding DNA sequence was identified in S.agalactiae. 
15 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF04356 GB:AF177167 type 1C restriction subunit [Streptococcus thermophilus] 
Identities = 358/1047 (34%), Positives = 571/1047 (54%), Gaps = 91/1047 (8%) 

Query: 7 TELELEKELIHLLETGESQWTYRKELKTEDALM)NFFKILAQNNTQYLNEEPLTASEKEQ 66 
20 +E +E + I +h E+QWTYR +LK+E+ALW NF L + N L E+PLT E +Q 

Sbjct: 4 SEQMIENQFIQILSEKENQWTYRPDLKSEEALWQNFRSHLNRINLAVLGEQPLTDKEFKQ 63 



Query: 67 IKNQLNFVNY- -YEAAKWIiAGEMGIAKVQVQREDAKLGTIRLEWKADNVAGGTSVYEIA 124 
+K + + + + A++WL GENG+A++ ++RED K + LE + +++GGTS YE+ 
25 Sbjct: 64 VKVEFSRLTGTPFLASQWLRGENGVAQ1LLEREDGK--RVTLEAFRNKDISGGTSSYEW 121 



Query: 125 NQVAFSGSRDRRGDVTLLINGLPMIQIELKSQNIIQ- --CIEAFNQVKKYDKEGQFRGIFST 182 

+QV SR RGDV+LLINGLP+I IELK ++ + ++A+ Q+++Y ++G F+GI++T 
Sbjct: 122 HQWPDSSRVDRGDVSLLINGLPIIHIELKKESAKDGFKQAYYQIQRYAEDGFFKGIYAT 181 

Query: 183 LQMFWSNKTDTRYIAAAKENKLNP NFLTQWVDQNNKPQKDLFAFAKEVLS I PRA 237 

Q+ V+SNK DTRY A E+ FL W ++M+ DLF F + VL IP A 

Sbjct: 182 TQIMVISNKVDTRYFARPSEDTAEAYARMKKFLFN^TEDNQTVSDLFDFTRTVLRIPDA 241 

Query: 238 HQMVMTYSVIDDDKKA LILLRPYQIHAIEAVAEASRHRKSGYIWHTTGSGKTLTSYK 294 

H+++ Y+++ DD+K L+ LRPYQIHAI + + + + G+IWH TGSGKT+TS+ 
Sbjct: 242 HELISQYTILVDDQKNQKFLMALRPYQIHAIRKIRQKAAQHEGGFIWHATGSGKTITSFV 301 



Query: 295 VARNILQI P -AVEKSI FVIDRKDLDNQTASAFQSYA- - - -QNDIFD- -VDETEDT 342 

+ + Q V++++ V+DR DLD QT F +A +N + + + ++ 

Sbjct: 302 ATKLIAQNAIGVDRTWIVVDRTDLDAQTQDEFTKFASEYHTGQTTENSVANTLIVGIKNQ 361 

Query: 343 RQLIKNLESS--DPJ^VVWTIQKLNAMISQMESYDTPKFKKLKERIAHLNWFVVDECHR 400 

+QL +NL SS + ++VTTIQKL+A + + K E+L ++VF+VDE HR 

Sbjct: 362 KQLAQNLLSSKITONTILVTTIQKIiSAAMRSAQQESEEKGSNQFEKLRQEHIVFIVDEAHR 421 



Query: 401 A VTPERQRYLTNTFRNSRWYGFTGT P I FVENKRAQLGDLAQTTEQQYGKCLHQYTVKEAI 460 

AV+ E + + NS W+G TGTPIF ENK+ + G A+TT QQYG LH YT+K A+ 

Sbjct: 422 AVSDEEMKRIKKILPNSTWFGLTGTPIFEENKKQENGTFARTTSQQYGPLLHSYTIKNAM 481 



Sbjct: 482 DDGAVLGFQVEYHSLISEEDQSVIVTQLMKGK1PDDALQQEKLLPTELYETDEHIRTMLQ 541 

Query: 501 SI IKQSR- - KKLGFNNGIGQTFEGLLTVKSIARAOAYYDLMKKVKAGETDLVI SKKVKEK 558 

I N+ KK NG T +LT SIA+A+ Y ++K++K T L+ ++ E+ 
Sbjct: 542 KIFNRRSVVKKFKVKNGF-PTMSAILTTHSIAQAKHIYRILKEMKDNGT-LLHGRQFDER 599 



Query: 559 L PDFPKVAITYSITEKDNASISRQDHlTKmEDYiraLFGTNFTIDNLQGYNRDIiND 614 

DFP+VAIT+S + + D++ + +++Y F + D + YN+++N 

Sbjct: 600 HQLIDKDFPRVAITFSTNPDQLEKNEQDDEI.VEIMKEYEKQFDASPYQDE-KLYNQNINK 658 

Query: 615 RLARKKDKFKDRHEQLDLVIVVDRLLTGFDAPCIjSTIFIDRQPMKPQHIIQAFSRraRIF 674 
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Sb 3 ct: 




Query: 




Sbjct: 




Query: 


733 


Sb j ct : 


777 


Query: 


793 


Sbjct: 




Query: 


845 


Sbjct: 


892 




905 


Sbjct: 


940 




964 


Sbjct: 


999 
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I- LD VIWDRLLTGFD+P + T++IDR+ M Q ++QAFSRTTJRI+ 



SEADLSDNPNDLKTMIAQVSAYQKLEKSYKALRS YDQYEEDFEE - - FSEV- - -VEQLPQY 831 



+G +N+ ED 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2881 

A DNA sequence (GASxl882) was identified in S.pyogenes <SEQ ID 8261> which encodes the amino acid 
sequence <SEQ ID 8262>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm --- Certainty=0. 3653 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 3 KSKQPQYRFDGFEGEWEEKELGDIVQITMGQSPSSQNYTTRPSDYILVQGNADIKNGYVF 62 

K K P+ RF GF EWE ++LGD V+I MGQSP+S+HYT +P+DYILVQGNAD+KNG V 
Sbjct: 13 KKKVPELRFKGFTDEWELRKLGDEVRIW.GQSPHSa^TDDPNDYILVQGNADMKNGRVL 72 

Query: 63 PRVWTTQITKQADKGDI1LSVRAPVGDVGKTNYHVIIGRGVAAIKGNEFIFQ1LKYLKEI 122 

PRVWTTQ+TKQA+K D+ILSVRAPVGD+GKT Y V+IGRGVAA1KGNEFIFQ L +K 
Sbjct: 73 PRVlOTTQVTKQAEKDDLlLSvRAPVGDIGRrAYDWrGRGVAAIKGNEFIFQNLGKMKSD 132 

Query: 123 GYWKRISTGSTFDSISSSDIKYAKIQIPSLPEQEAIGELFQM\'DQLIQLQDQKLATLKEQ 182 

GYW R STGSTF+SI+S+DIK A I +P++ EQ+ IG P+ +D I L +KL LKEQ 
Sbjct: 133 GYWTRYSTGSTFESINSTDIKEAIISVEAIEEQDKIGSFFKQLDNTIALHQRKLDLLKEQ 192 

Query: 183 KQTFLRKMFPAQGQKVPE IRLQGFKGEWEEKKLRE VSTHRSGTAIEKYFDSEGEFKVI S IG 243 

K+ FL+KMFP G KVPE+R GF +WEE+KL +++ +G G++ + G 

Sbjct: 193 KKGFLQKMFPraGAKVPELRFAGFADI^ERKLGDITKISTGKLDANAMVENGKYDFYTSG 253 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2882 

A DNA sequence (GASxl883) was identified in S.pyogenes <SEQ ID 8263> which encodes the amino acid 
sequence <SEQ ID 8264>. Analysis of this protein sequence reveals the following: 

Possible site: 61 

>» Seems to have no N-terrainal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 4318 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF04357 GB:AF177167 type IC modification subunit [Streptococcus thermophilus] 
Identities = 293/523 (56%) , Positives = 377/523 (72%) , Gaps = 6/523 (1%) 

Sbjct: 

Query: 56 I FEDAYQDEGLKDDLISVVTGDLGYFIEPTLTFEKLIQDVYHNTFQLESLAQGFRDI 122 

+ F + Y+++ KDDLI + GYFI+P F + F L h GF ++ 

Sbjct: 63 LYAGFMEWYEED--KDDLIENIQPRQGYFIQPDRLFYHYRIKADNYEFNLTDLQAGFNEL 120 

Query: 123 EQSGEDFENLFEDIDLYSKKLGSTPQKQNQTISNVMKTLNEIDFEAVDGDTLGDAYEYLI 182 

E+ GE+F LF DIDL S KLGS Q++N TI+ V++ L+EID +GD +GDAYEYLI 
Sbjct: 121 ERQGEEFSGLFSDIDLNSTKLGSNAQQRNVTITEVLRALDEIDLFEHNGDVIGDAYEYLI 180 

Query: 183 GEFASESGKKAGEFYTPQAVSHLMTQIVFLGREDQKGMTLYDPAMGSGSLLLNAKKYSNQ 242 

G FA+ +GKKAGEFYTPQAVS +M++I +G+E + +YDPAMGSGSL+LN ++Y 
Sbjct: 181 GMFAAGAGKKAGEFYTPQAVSRIMSEITSIGQESRVPFHIYDPAMGSGSLMLNIRRYLIH 240 

Query: 243 SDTVSYYGQEINTSTYNLAR>1ISIMMLHGVAIENQHLSNADTLDADWPTDEPINFDGVLMNP 302 

+ V Y+GQE+NT+T4NIiARMN++LHGV E +L+N DTLDADWP++EP FD V+MNP 
Sbjct: 241 r^QWYHGQELOTTTFNLARMffllilLHGVDKER 300 

Query: 303 EYSLKWSATAGFLTDPRFSSYGvLAPKSKADFAFLLHGFYHLKNTGTMAIVLPHGVLFRG 362 

PYS KWSA FL+DPRF +G LAPKSKADFAFLLKGFYHLK +GTM IVLPHGVLFRG 
Sbjct: 301 PYSAKWSAADKFLSDPRFERFGKLAPKSKADFAFLLHGFYHLKESGTMGIVLPHGVLFRG 360 

Query: 353 AAEGKIRQKLLEQGA1DTIIGLPSNIFYNTSIPTTIIILKKNRTNKDVFFIDASKEFDKG 422 

AEG IRQ LLE GAID +IGLP+NIF+ TSIPTT+IILKKNR+ +DV FIDAS++F+K 
Sbjct: 361 GAEGTIRQALLEMGAIDAVIGLPANIFFGTSIPTTVIILKKNRSRRDVLFIDASQDFEKQ 420 

Query: 423 KNQNTMTDNHIKKILDAYKSRDNSDKFSYLASFDEIIENDYNU^IPRYVDTFEEVPVKPL 482 

KNQN + D HI KI+ YK R++ ++++++ASFDEI END+NLNI PRYVDTFEE L 
Sbjct: 421 KNQNVLLDEHIDKIVSTYKKREDIERYAIT/ASFDEIQENDFNIjNIPRYVDTFEEEEPVDL 480 

Query: 483 PELAKQLSDIDQEIAKTNAKLDQLMKQLVGTTKEAQDELDTFR 525 

E+ L I++E+ + L L4 ++E Q +++ R 

Sbjct: 481 VE VNTNLLKINEELVQQEQTLLSLINDF - SESEENQAMIESMR 522 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
ir vaccines or d 
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Example 2883 

A DNA sequence (GASxl886R) was identified in S.pyogenes <SEQ ID 8265> which encodes the amino 
acid sequence <SEQ ID 8266>. Analysis of this protein sequence reveals the following: 

Possible site: 59 



» Seems to have an uncleavable N- 
INTEGRAL Likelihood = 
INTEGRAL Likelihood = 
Likelihood = 
Likelihood = 
Likelihood = 
INTEGRAL Likelihood = 
INTEGRAL Likelihood = 



a signal seq 



•ane 95 - 111 



147 - 173 



179 - 2051 
128 - 1521 



• Final Results 

bacterial membrane 
bacterial outside 
bacterial cytoplasm 



--- Certainty=0. 4270 (Affirmative) . 
--- Certainty=0. 0000 (Not Clear) < i 
--- Certainty=0 . 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2884 

A DNA sequence (GASxl890R) was identified in S.pyogenes <SEQ ID 8267> which encodes the amino 
acid sequence <SEQ ID 8268>. Analysis of this protein sequence reveals the following: 

Possible site: 58 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 4757 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) <. suco 

bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

RGD motif 339-341 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 46 KLFYSEFENQLFETIMFLSMKTLVLDINHFSKEIENK SEAYEQYIQQ-IREENGIN 100 

K F L + ++ L+ KTLVLD++ F K K S+ + Y+++ + I 

Sbjct: 135 KEFIINLLENLTQELIHLTSKTLVLDLHTFKKNEPLKGNDSSKRFIYYLKKRFNSKKDII 194 

Query: 101 HFFDRYPYLLKQINKEVGLIEESYSLLFDRFLEDLSEIKSCFNI - SEPLSNVAFSLGDSH 159 

F+ YP L++ + ++ + R EDL I++CFNI S L++++ S GDSH 

Sbjct: 195 AFYTCYPELMRITVVRMRYFLDNTKQMLIRVTEDLPSIQNCFNIQSSELNSISESQGDSH 254 

Query: 160 SKKQTWKIAFKE - KSVYYKPKSYHSHS I LLELTSLLKSSNI PS FSLPKSL VKADYCWQL 218 

S+ +TV + F + K + YKPK +S + L+ L +K++Y++ 
Sbjct: 255 SRGKTVSTLTFSDGKKIVYKPK- INSENKLRDFFEFLNKELEADIYIVKKVTRNTYFYEE 313 

Query: 219 GVAYTSSNK-DEVAKIYFKYGVLAAFSEIFSITDLHMENVIVSGGDLYLIDVETFFQRKL 277 

+ N +EV K Y +YG L + +F++TDLH EN+I G +ID ETFFQ+ + 

Sbjct: 314 YIDNIEINNIEEVKKYYERYGKLIGIAFLFNVTD^HYENIIAHGEYPVIIDNETFFQQNI 373 
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Query: 278 NVQNQNFEGITVDTYQRIYETSLSNGLFP- - -VQFEKNSAPNVSGISRKGGKRQKGKYEL 334 

+4- N TVD + ++ + GL P ++ + +S +S K Q +++ 

Sbjct: 374 PIEFGN- - SATVDAKYKYLDS IMVTGLVPYLflMKDKSDSKDEGVMLSALNFKEQSVPFKI 431 

Query: 335 I NKmGDLKlVKVDYFQEDRFNIPTLKGICVVEPLDYAMEIISGFRECYIFLLSQRSK 391 

+ N +++ + + NP+N + ++ Y I++G + + + K 

Sbjct: 432 LKIKNTFTDEMRFEYQTHIMDTAKOTPIMNNEKISFISYEKYIVTGMKSILMKAKDSKKK 491 

Query: 392 IKEIV-EGFPELKSRVPFRNTSDYGKFLQASTNPKYLFS EKKRKNLFSILYETKHI 446 

1+ LRRTYL+S+P + EK N+++ Y+ K + 

Sbjct: 492 IIAYIIOTLQNLIVRlWIRPTQRYAaMLEFSYHPNCFSI^IEREKVLHISn^YPYKNK^ 551 

Query: 447 EHFIVDNEIKDLMNGD1P-YFSMDTRGNVYNSVGTLIGNLGDTTSL FDSITILNDER 502 

H+ E DL++GDIP +++ ++ ++ S G L+ + ++L + I L DE 
Sbjct: 552 VHY EFSDLIDGDIPIFYNNISKTSLIASDGCLVEDFYQESALNRCLNKINDLCDED 607 

Query: 503 LKFTCELLEIVL 514 

+ LEI L 

Sbjct: 608 ISIQTVWLEIAL 619 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2885 

A DNA sequence (GASxl891R) was identified in S.pyogenes <SEQ ID 8269> which encodes the amino 
acid sequence <SEQ ID 8270>. Analysis of this protein sequence reveals the following: 
Possible site: 40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 3487 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Wot Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MSFMKNSKDILTNAIEEVSEKELMEVAGGKKGSGWFATITDDCPNSVFvTCC 51 

M+ MKNSKDIL NAIEEVSEKELMEVAGGK+GSGW ATITDDCPNSVFVCC 
Sbjct: 1 MNAMKNSKDILNNAIEEVSEKELMEVAGGKRGSGWIATITDDCPNSVFVCC 51 

45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2886 

A DNA sequence (GASxl901R) was identified in S.pyogenes <SEQ ID 8271> which encodes the amino 
acid sequence <SEQ ID 8272>. Analysis of this protein sequence reveals the following: 

50 Possible site: 13 

10 N-terminal signal sequence 

19 ( 1 - 20) 
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bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

5 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2887 

A DNA sequence (GASxl905R) was identified in S.pyogenes <SEQ ID 8273> which encodes the amino 
10 acid sequence <SEQ ID 8274>. Analysis of this protein sequence reveals the following: 
Possible site: 25 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -0.48 Transmembrane 38 - 54 ( 37 - 54) 

15 

Final Results 

bacterial membrane Certainty=0. 1192 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty= 0 . 0 0 0 0 (Not Clear) < suco 

20 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

25 Example 2888 

A DNA sequence (GASxl911R) was identified in S.pyogenes <SEQ ID 8275> which encodes the amino 
acid sequence <SEQ ID 8276>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

30 >» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-10.40 Transmembrane 27 - 43 ( 22 - 48) 
INTEGRAL Likelihood = -9.82 Transmembrane 52 - 68 ( 50 - 74) 
INTEGRAL Likelihood = -7.27 Transmembrane 113 - 129 ( 111 - 134) 
INTEGRAL Likelihood = -1.97 Transmembrane 137 - 153 ( 135 - 153) 

35 

Final Results 

bacterial membrane Certainty=0 . 5161 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

40 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2889 

A DNA sequence (GASxl915R) was identified in S.pyogenes <SEQ ID 8277> which encodes the amino 
acid sequence <SEQ ID 8278>. Analysis of this protein sequence reveals the following: 

Possible site: 31 



Final Results 

bacterial membrane Certainty=0 . 5310 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS -specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2890 

A DNA sequence (GASxl918R) was identified in S.pyogenes <SEQ ID 8279> which encodes the amino 
acid sequence <SEQ ID 8280>. Analysis of this protein sequence reveals the following: 
Possible site: 38 



• Final Results 

bacterial membrane Certainty=0. 3930 (Affirmative) < succ: 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2891 

A DNA sequence (GASxl923R) was identified in S.pyogenes <SEQ ID 8281> which encodes the amino 
acid sequence <SEQ ID 8282>. Analysis of this protein sequence reveals the following: 

Possible site: 42 

»> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood =-12.26 Transmembrane 20 - 36 ( 13 - 42) 

Final Results 

bacterial membrane Certainty=0. 5904 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2892 

A DNA sequence (GASxl926) was identified in S.pyogenes <SEQ ID 8283> which encodes the amino acid 
5 sequence <SEQ ID 8284>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

»> Seems to have no N-terminal signal sequence 

10 Final Results 

bacterial cytoplasm Certaxnty=0. 2322 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2893 

20 A DNA sequence (GASxl928R) was identified in S.pyogenes <SEQ ID 8285> which encodes the amino 
acid sequence <SEQ ID 8286>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0. 3395 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2894 

A DNA sequence (GASxl929R) was identified in S.pyogenes <SEQ ID 8287> which encodes the amino 
acid sequence <SEQ ID 828 8>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

40 >» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.86 Transmembrane 17 - 33 ( 15 - 33) 

Final Results 

bacterial membrane Certainty=0 .1744 (Affirmative) < suco 

45 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2895 

5 A DNA sequence (GASxl931R) was identified in S.pyogenes <SEQ ID 8289> which encodes the amino 
acid sequence <SEQ ID 8290>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

»> Seems to have no N-terrainal signal sequence 

10 

Final Results 

bacterial cytoplasm Certainty=0 . 0551 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2896 

A DNA sequence (GASxl941R) was identified in S.pyogenes <SEQ ID 8291> which encodes the amino 

acid sequence <SEQ ID 8292>. Analysis of this protein sequence reveals the following: 

Possible site: 16 

25 ;>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2377 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

30 bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2897 

A DNA sequence (GASxl949) was identified in S.pyogenes <SEQ ID 8293> which encodes the amino acid 
sequence <SEQ ID 8294>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .0262 (Affirmative) < suco 

45 bacterial membrane Certainty=0. 0000 (Not Clear) <; suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



WO 02/34771 



-2861- 



PCT/GB01/04789 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2898 

A DNA sequence (GASxl951R) was identified in S.pyogenes <SEQ ID 8295> which encodes the amino 
acid sequence <SEQ ID 8296>. Analysis of this protein sequence reveals the following: 

Possible site: 45 

10 >» Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .1330 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

15 bacterial outside --- Certainty=0.O000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2899 

A DNA sequence (GASxl953) was identified in S.pyogenes <SEQ ID 8297> which encodes the amino acid 
sequence <SEQ ID 8298>. Analysis of this protein sequence reveals the following: 

Possible site: 15 

25 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2900 

A DNA sequence (GASxl957) was identified in S.pyogenes <SEQ ID 8299> which encodes the amino acid 
sequence <SEQ ID 8300>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

40 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2409 (Affirmative) < suco 

45 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

5 Example 2901 

A DNA sequence (GASxl969) was identified in S.pyogenes <SEQ ID 8301> which encodes the amino acid 
sequence <SEQ ID 8302>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

10 »> Seems to have an uncleavable N-term signal seg 

INTEGRAL Likelihood = -2.28 Transmembrane 7 - 23 ( 7 - 23) 

Final Results 

bacterial membrane Certainty=0 . 1914 (Affirmative) < suco 

15 bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2902 

A DNA sequence (GASxl971R) was identified in S.pyogenes <SEQ ID 8303> which encodes the amino 
acid sequence <SEQ ID 8304>. Analysis of this protein sequence reveals the following: 

Possible site: 21 

25 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 154 5 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2903 

A DNA sequence (GASxl973) was identified in S.pyogenes <SEQ ID 8305> which encodes the amino acid 
sequence <SEQ ID 8306>. Analysis of this protein sequence reveals the following: 

40 Possible site: 49 

>» Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -2.44 Transmembrane 31 - 47 ( 31 - 48) 

45 Final Results 

bacterial membrane --- Certainty^O. 1977 Affirmative) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 3 M1ISFESVILKHNKIITPEKRLFMKKTKLIFSFTSIFIAIISRPVFGLEVDNNSLLRHIY 62 

MIISFESVILKHNICI1TPEKRLFMXKTKL1FSFTSIFIAIISRPVFGLEVDNNSLLRNIY 
Sbjct: 1 MI ISFESVILKHNKI ITPEKRLFMKKTKL1 FSFTS I FIAII SRPVFGLEVDNNSLLRNIY 60 

Query: 63 STIVYEYSDWIDFKTSHNLVTKKI^VRDAPJDFFINSEmEYAANDFKDGDKIAMFSVPF 122 

STIVYEYSD VIDFKTSHNLVTKKLDVRDARDFFINSEMDEYAANDFK GDKIA+FSVPF 
Sbjct: 61 STIVYEYSDIVIDFKTSHNLVTKKLDVRnARDFFINSEMDEYAAMDFKTGDKIAVFSVPF 120 

Query: 123 DWNYLSEGKVIAYTYGGMTPYQEEPMSKNIPVNLWINRKQIPVPYNQISTNKTTVTAQEI 182 

DWNYLS+GKV AYTYGG4TPYQ+ K VNLWIN KQI VPYN+ISTNKTTVTAQEI 
Sbjct: 121 DWOTLSKGKOTAYTYGGITPYQKLQYLKISLVl^WINGKQISVPYNEISTOKTTVTAQEI 180 

Query: 183 DLKVRKFL1SQHQLYSSGSSYKSGKLVFHTNDNSDKYSLDLFYVGYRDKESIFKVYKDNK 242 

DLKVRKFLI+QHQLYSSGSSYKSG+LVFHTMDNSDKYS DLFYVGYRDKESIFKVYKDNK 
Sbjct: 181 DLKVRKFLIAQHQLYSSGSSYKSGRLVFHTNDNSDKYSFDLFYVGYRDKESIFKVYKDNK 240 

Query: 243 SFNIDKIGHLDIEIDS 258 

SFNIDKIGHLDIEIDS 
Sbjct: 241 SFNIDKIGHLDIEIDS 256 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2904 

A DNA sequence (GASxl974R) was identified in S.pyogenes <SEQ ID 8307> which encodes the amino 
acid sequence <SEQ ID 8308>. Analysis of this protein sequence reveals the following: 

Possible site: 53 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .2022 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2905 

A DNA sequence (GASxl983) was identified in S.pyogenes <SEQ ID 8309> which 
sequence <SEQ ID 8310>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

>» Seems to have no N-terminal signal sequence 
Final Results 
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bacterial cytoplasm — Certainty=0. 0989 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

5 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2906 

10 A DNA sequence (GASxl987) was identified in S.pyogenes <SEQ ID 831 1> which encodes the amino acid 
sequence <SEQ ID 8312>. Analysis of this protein sequence reveals the following: 

Possible site: 34 

»> Seems to have no N- terminal signal sequence 

15 

Final Results 

bacterial cytoplasm Certainty=0. 2389 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 

20 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2907 

25 A DNA sequence (GASxl988) was identified in S.pyogenes <SEQ ID 8313> which encodes the amino acid 
sequence <SEQ ID 8314>. Analysis of this protein sequence reveals the following: 

Possible site: 48 

»> Seems to have no N-terminal signal sequence 

30 

Final Results 

bacterial cytoplasm Certainty=0 . 5904 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB16031 GB:AB030747 transposase [Streptococcus pyogenes] 
Identities = 22/24 (91%) , Positives = 23/24 (95%) 

40 

Query: 1 LERLFGTAKEYHNLCYTREKGKSK 24 

+ERLFGTAKEYHNL YTREKGKSK 
Sbjct: 399 IERLFGTAKEYHNLRYTREKGKSK 422 

45 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2908 

A DNA sequence (GASxl990R) was identified in S.pyogenes <SEQ ID 8315> which encodes the amino 
acid sequence <SEQ ID 8316>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

5 

>» Seems to have a cleavable N-term signal seg. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

10 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
1 5 antigens for vaccines or diagnostics. 

Example 2909 

A DNA sequence (GASxl991) was identified in S.pyogenes <SEQ ID 8317> which encodes the amino acid 
sequence <SEQ ID 8318>. Analysis of this protein sequence reveals the following: 

Possible site: 53 

20 

Seems to have an uncleavable N-term signal seq 
INTEGRAL Likelihood = -0.16 Transmembrane 2 - 18 ( 1-18) 

Final Results 

25 bacterial membrane — Certainty=0 . 1065 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2910 

A DNA sequence (GASxl994) was identified in S.pyogenes <SEQ ID 8319> which encodes the amino acid 
35 sequence <SEQ ID 8320>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -1.44 Transmembrane 28 - 44 ( 28 - 44) 

40 

Final Results 

bacterial membrane Certainty=0 . 1574 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm --- Certainty=0 . 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2911 

A DNA sequence (GASxl996) was identified in S.pyogenes <SEQ ID 832 1> which encodes the amino acid 
sequence <SEQ ID 8322>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1076 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2912 

A DNA sequence (GASxl997R) was identified in S.pyogenes <SEQ ID 8323> which encodes the amino 
acid sequence <SEQ ID 8324>. Analysis of this protein sequence reveals the following: 
Possible site: 61 

>>> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -7.96 Transmembrane 53 - 69 ( 49 - 75) 
INTEGRAL Likelihood = -2.34 Transmembrane 24 - 40 ( 24 - 43) 

Final Results 

bacterial membrane Certainty=0. 4185 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2913 

A DNA sequence (GASx2007R) was identified in S.pyogenes <SEQ ID 8325> which encodes the amino 
acid sequence <SEQ ID 8326>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

10 N-terminal signal sequence 

46 - 62 ( 43 - 65) 



lal Results 

bacterial membrane Certainty=0.3654(Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
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The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB97959 GB:U96166 ATP-binding cassette lipoprotein 
[Streptococcus cristatus] 
Identities = 37/60 (51%) , Positives = 42/60 (69%) , Gaps = 1/60 (1%) 

Query: 59 FLTACGTmOOSKKEEVKEIKMSDIKIlDAVSKKTICVvDGEEVTEYTTKDGIWIQIPAGNEE 118 

FL ACG+K KE + + K D K DAV +KTK VDG+EVTEYT DGNVIQIPA EE 
Sbjct: 12 FIAA.CGSKNADNKE-ISDGKKTOFKIOAVDQKTKTVDGKEVTEYTMPDGNVIQIPADGEE 70 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2914 

A DNA sequence (GASx2009) was identified in S.pyogenes <SEQ ID 8327> which encodes the amino acid 
sequence <SEQ ID 8328>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 1246 (Affirmative) < suco 

bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2915 

A DNA sequence (GASx2010) was identified in S.pyogenes <SEQ ID 8329> which encodes the amino acid 
sequence <SEQ ID 8330>. Analysis of this protein sequence reveals the following: 
Possible site: 17 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2549 (Affirmative) < suco 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2916 

A DNA sequence (GASx2012R) was identified in S.pyogenes <SEQ ID 8331> which encodes the amino 
acid sequence <SEQ ID 8332>. Analysis of this protein sequence reveals the following: 

Possible site: 28 
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>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3307 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA27007 GB:L26141 pyrogenic exotoxin B [Streptococcus pyogenes:] 
Identities = 40/102 (39%) , Positives = 57/102 (55%) , Gaps = 7/102 (6%) 

Query: 2 EMHFWTEPFARRIAETFCAENTQTK3TMRVQQLSYPSDTDHSGGEL YIYALSPA 56 

+ +F R E EA+ A TF ++ K R + D + GGEL YIY +S 

Sbjct: 28 DQNFARNEKEAKDSAITFIQKSAAIKAGARSAE-DIKLDKVNLGGELSGSNMYIYNISTG 86 

Query: 57 GFIIVSGDTRAHTILGYSFDNNLDLN-HDNVRSMIEAYQKQI 97 

GF+IVSGD R+ ILGYS + D+N +N+ S +E+Y 4QI 
Sbjct: 87 GFVIVSGDKRSPEILGYSTSGSFDVNGKENIASFMESYVEQI 128 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2917 

A DNA sequence (GASx2013R) was identified in S.pyogenes <SEQ ID 8333> which encodes the amino 
acid sequence <SEQ ID 8334>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

»> Seems to have a cleavable N-term signal seq. 

Final Results 

bacterial outside Certainty=0 .3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) c suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2918 

A DNA sequence (GASx2014R) was identified in S.pyogenes <SEQ ID 8335> which encodes the amino 
acid sequence <SEQ ID 8336>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .1392 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside — - Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2919 

A DNA sequence (GASx2015) was identified in S.pyogenes <SEQ ID 8337> which encodes the amino acid 
sequence <SEQ ID 8338>. Analysis of this protein sequence reveals the following: 

Possible site: 35 



Final Results 

bacterial membrane Certainty=0. 1702 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2920 

A DNA sequence (GASx2018) was identified in S.pyogenes <SEQ ID 8339> which encodes the amino acid 
sequence <SEQ ID 8340>. Analysis of this protein sequence reveals the following: 

Possible site: 29 

10 N-terminal signal sequence 

23 - 39 ( 22 - 40) 



Final Results 

bacterial membrane Certainty=0 . 3 3 3 6 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2921 

A DNA sequence (GASx2019) was identified in S.pyogenes <SEQ ID 834 1> which encodes the amino acid 
sequence <SEQ ID 8342>. Analysis of this protein sequence reveals the following: 

Possible site: 26 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 .0669 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC98898 GB:AF023179 low temperature requirement C protein 
[Listeria monocytogenes] 
Identities = 95/144 (65%), Positives = 117/144 (80%) 

Query: 15 LAERGVSLFAIAELVLFLQNDYI PNLTMAECLES VEAVIAKREVQNAI ITGVELDKLAEA 74 

L ERGV ++ IAELVLFLQ Y P L + C ++VE VL KREVQNA++TG++LD +AE 
Sbjct: 16 L IERGVE 1 DDIAELVLFLQQKYHPGljELDI CRQ^'EHVLRKREVQNAVLTG I QLDVMAEK 75 

Query: 75 NQLSEPLLSILKTDQGLYGIDEILALSIVNLYGSIGFTNYGYLDKTKPGIVDKLNHKDGY 134 

+L +PL +1+ D+GLYG+DE I LALS I VN+ YGS IGFTNYGY4DK KPGI+ KLN DG 
Sbjct: 76 GELVQPLQNIISADEGLYGVDEIIJ^SIVNVYGSIGFTI^GYIDKVKPGILAKIiNEHDGI 135 

Query: 135 SCHTFLDDI VSAIAAAAASRIAHN 158 

+ HTFLDDIV AIAAAAASR4AH+ 
Sbjct: 136 AVHTFLDDIVGAIAAAAASRLAHS 159 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 



pie 2922 

A DNA sequence (GASx2030) was identified in S.pyogenes <SEQ ID 8343> which encodes the amino acid 
sequence <SEQ ID 8344>. Analysis of this protein sequence reveals the following: 
Possible site: 18 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 0320 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2923 

A DNA sequence (GASx2031) was identified in S.pyogenes <SEQ ID 8345> which encodes the amino acid 
sequence <SEQ ID 8346>. Analysis of this protein sequence reveals the following: 

40 Possible site: 24 

>» Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm Certainty=0. 0583 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 
50 The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2924 

A DNA sequence (GASx2032R) was identified in S.pyogenes <SEQ ID 8347> which encodes the amino 
acid sequence <SEQ ID 8348>. Analysis of this protein sequence reveals the following: 

Possible site: 53 



Final Results 

bacterial membrane Certainty=0. 2105 (Affirmative) < suco 

bacterial outside — Certainty=0 . 0000 (Not Clear) < suco 
bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No con-esponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

A related GBS gene <SEQ ID 8467> and protein <SEQ ID 8468> were also identified. Analysis of this 

protein sequence reveals the following: 

Lipop: Possible site: -1 Crend: 10 
McG: Discrim Score: -11.19 
GvH: Signal Score (-7.5): -4.94 

Possible site: 49 
>>> Seems to have no N-terminal signal sequence 
ALOM program count: 1 value: -4.19 threshold: 0.0 

INTEGRAL Likelihood = -4.19 Transmembrane 25 - 41 ( 25 - 42) 
PERIPHERAL Likelihood = 13.26 41 
modified ALOM score: 1.34 

*** Reasoning Step: 3 

Final Results 

bacterial membrane Certainty=0 .2678 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 

The protein has homology with the following sequences in the databases: 

ORF01616(304 - 429 of 771) 

SP|O06442|SECE_STAAU(7 - 48 of 60) PREPROTEIN TRANSLOCASE SECE 
GP 1 2078376 | gb lAAE-54017. 1 1 |U96619 SecE {staphylococcus aureus] 
%Match =5.4 

%Identity = 26.2 %Similarity =57.1 

Matches = 11 Mismatches = 18 Conservative Sub.s = 13 

99 129 159 189 219 249 279 309 

RIIQIMLK*HLWRRYGTKESKPSVYRMRKPJ01iIjNRSK*HPQAOTTRSK*IL*IL*EvYNTQRNALI*RNKLQKGELIMFV 

I 

MAKKESFF 

339 369 39S 429 459 489 519 549 

KGIFQVLRDTTWPNRKQRWKDFISILEYWFFTIVIYIFDKLIiAAGVMDLINRF***IILDRNNPNP*ILLRVFCVENNI 
11= = 1=11 =1 = :: =|| : | :| 

KGVKSEMEKTSWPTKEELFKYTVIWSTVIFFLVFFYALDLGITALKNLLB'G 
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SEQ ID 8468 (GBS396) was expressed in E.coli as a GST-fusion product. SDS-PAGE analysis of total cell 
extract is shown in Figure 83 (lane 9; MW 35kDa). 

GBS396-GST was purified as shown in Figure 217, lane 8. 
Example 2925 

A DNA sequence (GASx2034R) was identified in S.pyogenes <SEQ ID 8349> which encodes the amino 
acid sequence <SEQ ID 8350>. Analysis of the protein sequence reveals the following: 

Possible site: 21 

10 N- terminal signal sequence 

Transmembrane S3 - 69 ( 53 - 70) 

Final Results 

bacterial membrane Certainty=0 . 1235 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2926 

A DNA sequence (GASx2035) was identified in S.pyogenes <SEQ ID 835 1> which encodes the amino acid 
sequence <SEQ ID 8352>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 2928 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2927 

A DNA sequence (GASx2042R) was identified in S.pyogenes <SEQ ID 8353> which encodes the amino 
acid sequence <SEQ ID 8354>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2547 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 
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bacterial outside Certainty=0 . 0000 (Not Clear) < succ? 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

5 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2928 

A DNA sequence (GASx2043) was identified in S.pyogenes <SEQ ID 8355> which encodes the amino acid 
sequence <SEQ ID 8356>. Analysis of this protein sequence reveals the following: 

10 Possible site: 26 

>>> Seems to have no N-terminal signal sequence 

Final Results 

15 bacterial cytoplasm Certainty=0 . 3289 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

20 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2929 

A DNA sequence (GASx2049) was identified in S.pyogenes <SEQ ID 8357> which encodes the amino acid 
25 sequence <SEQ ID 8358>. Analysis of this protein sequence reveals the following: 

Possible site: 13 

»> Seems to have no N-terminal signal sequence 

30 Final Results 

bacterial cytoplasm Certainty=0. 4014 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2930 

40 A DNA sequence (GASx2052) was identified in S.pyogenes <SEQ ID 8359> which encodes the amino acid 
sequence <SEQ ID 8360>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

>» Seems to have a cleavable N-term signal seq. 

45 

Final Results 
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bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2931 

A DNA sequence (GASx2055R) was identified in S.pyogenes <SEQ ID 836 1> which encodes the amino 
acid sequence <SEQ ID 8362>. Analysis of this protein sequence reveals the following: 

Possible site: 32 

>>> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3 04 8 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB05703 GB:AP001513 imidazolonepropionase 

(imidazolone-5-propionate hydrolase) [Bacillus halodurans] 
Identities = 203/416 (48%) , Positives = 278/416 (66%) , Gaps = 4/416 (0%) 

Query: 11 DVLLTHFNQLFCLNDPGHPLTGQEMKKATIVEDGYIAIKDGLIVALGSGEPDAELVGTQT 70 

D LL + QL + G P G+EM + ++E + I+DG + +G+ Q 
Sbjct: 6 DTLLWIGQLLPMESKG-PKRGKEMSELQLLEHAALGIRDGIWAFIGTMVEADTFTANQM 64 

Query: 71 IMRSYKGKIATPGIIDCHTHLWGGSREHEFAKKLAGVSYLDILAQGGGILSTVRATRSA 130 

I +GK+ TPG++D HTHL+ +GGSREHE A K GV YL+IL GGGIL+TV ATR+A 
Sbjct: 65 I - -DCQGKL VTPGL VDPHTHLI FGGSREHEMALKQQGVPYLE I LKNGGGI LATVEATRAA 122 

Query: 131 SFDNLYQKSKRLLDYMLLHGVTTVEaKSGYGIiDWETEKRQLDWAALEKDHPIDLVSTFM 190 

S + L K+ L4 ML +GVTT+EAKSGYGLD ETE +QL A+ + HPID+VSTF+ 
Sbjct: 123 SEEELITKAICHLNRMLSYGVTTIEAKSGYG^QRETEWKQLRAAKAVGEQHPIDIVSTFL 182 

Query: 191 AAHAIPEEYKGNPKAYLDVIIKDMLPWKEENLAEFCDIFCEKNVFTADESRYLLSKAKE 250 

AHAIP ++ +P +LD + DML +KE+NLAEF DIF E VFT ++SR L KAKE 
Sbjct: 183 GAHAI PTSHRNDPDRFLDEMA-DMLGE I KEQNIAEFVD I FTETGVFTVEQSRTFLQKAKE 241 

Query: 251 MGFKLRIHADEIASIGGVDVAAELSAVSAEHIJyMITDrx31AKLIGffiGVlGNIiLPATTFSL 310 

GF L++HADEI +GG ++A EL A+SA+HL+ +D GI K+ AG I LLP TTF L 
Sbjct: 242 RGFGLKLHADEIDPLGGAEIAGELGAISADHLVGASDQGIQKMAAAGTIACLLPGTTFYL 301 

Query: 311 MEDTYAPARKMIDAGMAITLSTDSNPGSCPTAl^QFVMQLGCFMLRLTPIEVLNAvTINA 370 

+DTYA AR MID G+A+T+STD NPGS PT N+Q +M + L++TP E+ +AVT+N 
Sbjct: 302 GKDTYARARDMIDCGIA.VTISTDFNPGSSPTENLQLIMSIAALRLKMTPEEITOIAVTVNG 361 

Query: 371 AYSVNRQERVGSLTVGKEADIAIFDAPNIDYPFYFFATNLIHQVYKKGQLTVDRGR 426 

A+++ R + G L VG+ AD+ ++DA N Y Y + N +H V+KKG++ +R R 
Sbjct: 362 AHAIGRGDTAGQl^VGRAADWVraDAKNYYYWYHYG 417 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2932 

A DNA sequence (GASx2056) was identified in S.pyogenes <SEQ ID 8363> which encodes the amino acid 
sequence <SEQ ID 8364>. Analysis of this protein sequence reveals the following: 

Possible site: 14 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 1847 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0 0 0 0 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB61139 GB:AL132952 predicted using Genef inder-cDNA EST 

ykl55e6.3 comes from this gene-cDNA EST ykl55eS.5 comes 
from this gene-cDNA EST ykl56d6.5 comes from this 
gene-cDNA EST yk259bl0.3 comes fr 
Identities = 3C2/649 (46%) , Positives = 419/649 (64%) , Gaps = 17/649 (2%) 

Query: 29 EGIRRAPDRGFRLTQAQTEIALKNALRYVPTKFHEEVIPEFLEELKTRGRIYGYRFRPKD 88 

+ + AP R LTQ + +A++NALRY+P + H + EF EEL T G IYGYRF P 
Sbjct: 85 KNVAHAPmPCWLTQTEmiAVRNALRYIPKEHHVLLATEFAEELNTYGHIYGYRFMPNF 144 

Query: 89 RIYGKPIDEYKGNCTAAKAMQVMIDNNLSFEIALYPYELVTYGETGSVCANWMQYCLIKK 148 

++ P+ E +C A A+ +MI NNL +A +P ELVTYG G V +NW+Q+ L+ + 
Sbjct: 145 DLFAPPVSEIGAHCEQASAIILMILNNI.DKRVAQFPQELVTYGGNGQVFSNWIQFRLVLR 204 

Query: 149 YLEVMTDEQTLVVESGHPVGLFKSKPEAPRVIITNGI^VGEYDNMKDWEIAEEMGVTNYG 208 

YL MTD QTLV+ SGHP+GLF S P++PR+ +TNG+++ Y + ++ +GVT YG 
Sbjct: 205 YLYTMTDHQTLVLYSGHPLGLFPSTPDSPRmVTNGMMIPSYSTKELYDKYFALGVTQYG 264 

Query: 209 QMTAGGWT^IGPQGIVHGTFNTLl^AGPiKI^VADDGDLTGKLFISSGLGGMSGAQGJaA 268 

QMTAG + YIGPQGIVHGT T+LNAGR ++G+ L GK+F+++GLGGMSGAQ KAA 

Sbjct: 265 QMTAGSFCYIGPQGIVHGTTITVLNAGR-RMGL DSLAGKVFVTAGLGGMSGAQPKAA 320 

Query: 269 EIAKAVAIIAEVDQSRIKTRHSQGWISQIAESPEEALQLAQKAIDAKESTSIAYHGNIVD 328 

+1A + +IAE+ + + RH QGW+ ++ EE + ++ + KE+ SI Y GN+VD 
Sbjct: 321 KI AGC IGVI AE I SDTALLKRHQQGWLD VY'SKDLEE I VNW I KEYREKKEAIS I GYLGNWD 380 

Query: 329 LLE-YAOTOKQIHvDLLSDQTSCHNVYDGGYCPVGISFDERTRLLAEDKDTFHQMVDDTLA 387 

L E + + V+L SDQTS HN + GG+ P G++F++ +++ D F ++V ++L 
Sbjct: 381 LWERLAEEPECLVELGSDQTSLHNPFLGGFYPAGLTFEQSNQMMTSDPVKFKKLVQNSLI 440 



R AI + G YF+DYGNAF+ +G + ++ ++DK F +PSY++DIMG + 

Sbjct: 441 RQIAAIDKIAAKGMYFWDYGNAFIjLECQRAGANLIjREDAQDDK-SFRYPSYMQDIMGD-I 498 

Query: 448 FDYGYGPFRWVCLSGNHDDLVATDKAAMEAIDPDR RYQDRDNYNWIRDAEKN 499 

F G+GPFRWVC SG +DL TD+ A + ID + + Q DN WI +AEKN 

Sbjct: 499 FSMGFGPFRWVCTSGKPEDLRLTDQTACKIIDELKDTDVPEYVKQQYLDNKKWIEEAEKN 558 

Query: 500 QLWGTQARILYQDCIGRVTIALKFNELVRKGKI-GEVMIGRDHHDVSGTDSPFRETSNI 558 

+LWG+QARILY D GRV +A FNELV+ GK+ ++I RDHHDVSGTDSPFRETSN+ 
Sbjct: 559 KLWGSQARILYSDRAGRVALASAFNELVKSGKVSAAIVISRDHHDVSGTDSPFRETSNV 618 

Query: 559 KDGSNVTCDMAVQCYAGNAARGMSLVALHNGGGTGIGKAINGGFGLVLDGSERIDEIIKS 618 

DGS T DMAVQ G++ RG + VALHNGGG G G INGGFG+VLDGS + 
Sbjct: 619 YDGSAFTADMAVQNCIGDSFRGATWVALmiGGGVGWGDVINGGFGIVLDGSSDAARRAEG 678 

Query: 619 AIAWDTMGGVARRNWARNEHAIETAIEYIIRLHAGTDHITIPYLADDDLV 667 

+ WD GV RR+W+ N A E AI4 +T+P AD++L+ 

Sbjct: 679 MIjNWDVPNGVTRRSWSGNAKAQE-AIQRAEKQVDGLRVrLPVEADEELL 726 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2933 

A DNA sequence (GASx2057) was identified in S.pyogenes <SEQ ID 8365> which encodes the amino acid 
sequence <SEQ ID 8366>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

>» Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 1887 (Affirmative) < suco 
bacterial membrane — Certainty=0.0000 (Mot Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAD35925 GB:AE001751 

formiminotransferase- 

cyclodeaminase/f ormiminotetrahydrof olate cyclodeaminase , 
putative [Thermotoga maritima] 
Identities = 160/295 (54%), Positives = 214/295 (72%), Gaps = 2/296 (0%) 

KIVECIPNFSEGQNQAVIDGLVATAKSIPGVTLLDYSSrjRSHHESVETLVGDDQSIQEAA 62 
K++E +PNFSEG+ + V++ +VA AK V +LD+S DA HNRSV TLVG+ +++ A 
KLIESVPNFSEGE^KEVVEKIVAEaUOCYDRVWVLDWSMDADKNRSVITLVGEPENLlNAL 61 

FQLVKyASEKIDMTKHHGEHPRMGATDVCPFVPIKDITTQECVEISKQVAERINRELGIP 122 
F + K A+E ID+ H G+HPRMGA DV P VP+ + T +ECVE SK + RI ELGIP 



++LYE SATRPERQNLA +RKG+FEG EK+ + W PD+G ++HPTAGVTAVGAR L 

:DIAHKIAKI IRGSGGGYKYCKAIGVMLEDF 
WNL T ++ IA KIA+ IR S GG +Y KAIGV L+ f 
JVNLGTRDVKIAEKlARAIRFSSGGLRYVKAIGvDLKCT 

STIKFEARRyGvNVIGSEVIGLAPAKALIDVAEYYLQVI 
I IK EA RYGV V+GSE++GL P ++L+ YYL+ + K+++E++LL 
3LIKMEAERYGVPVLGSEIVGLFPLESLLKTV3YYLRTD- -LNAKKVIESNLL 295 

ysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
45 antigens for vaccines or diagnostics. 

Example 2934 

A DNA sequence (GASx2058) was identified in S.pyogenes <SEQ ID 8367> which encodes the amino acid 
sequence <SEQ ID 8368>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

50 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2776 (Affirmative) < suco 

55 bacterial membrane -— Certainty=0.0000 (Not Clear) < suco 

bacterial outside -— Certainty=0. 0000 (Not Clear) < suco 



Query: 


3 


Sbjct: 


2 


Query: 


63 


Sbjct: 


62 




123 


Sbjct: 


122 


Query: 


183 


Sbjct: 


182 


Query: 


243 


Sbjct: 


242 


sed on this 
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Query: 




Sbjct: 
Query: 


67 


Sbjct: 


66 




127 


Sbjct: 


126 




187 


Sbjct: 


186 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AftA62653 GB-.L33465 methenyl tetrahydrofolate cyclohydrolase 
[Methylobacterium extorquens] 
Identities = 79/198 (39%) , Positives = 112/198 (55%) 

SLTDFAKVLGSDAPAPGGGSAAALSGANGISLTHvIVCELTLGKKKYADYQDIITEIHAKS 65 
+4- F L S AP PGGG AAA+SGA G +L MVC LT+GKKKY + + + ++ KS 
TIETFLDGLASSAPTPGGGGAAAISGAMGAALVSNIVCNLTIGKKKYVEVEADLMQVLEKS 65 

TALQASLLAAIDKDTEAFNLVSAVFDMPKETDBDKAARRTAMQKALKTAAQSPFEMMTLM 126 
L+ +L ID EAF+ V + +PK TDE+KAAR +Q+ALKTA P + 



E +++ K N N SD GVA h+ AGL+ A LNV +N G+ D F + ++ 

lEVIDIAEIVAEKG>nLNVISDAGVAVLSAYAGLRSAAGNVYVNAKGLDDRAFAEERLKEL 185 

JALLDKGCHLADDIYTKI 204 
■ LL + L + IY + 
iGLLAEAGALNERIYETV 203 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2935 

A DNA sequence (GASx2061) was identified in S.pyogenes <SEQ ID 8369> which encodes the amino acid 
sequence <SEQ ID 8370>. Analysis of this protein sequence reveals the following: 
Possible site: 22 

>» Seems to have no N-tertninal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3924 (Affirmative) < suco 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2936 

A DNA sequence (GASx2063) was identified in S.pyogenes <SEQ ID 8371> which encodes the amino acid 
sequence <SEQ ID 8372>. Analysis of this protein sequence reveals the following: 

Possible site: 57 

>» Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -1.06 Transmembrane 231 - 247 ( 231 - 247) 
INTEGRAL Likelihood = -0.53 Transmembrane 2- 18( 1- 18) 

Final Results 

bacterial membrane - 
bacterial outside - 
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bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

= 2/477 (0%) 

Query: 42 VINLDGESLTIEDVIAIARQGVACHIDDSAIEAVNASRKIVDDIVSEKRWYGVTTGFGS 101 

++ LDG SLT DV + + ++E V SR V+ IV +++ +YG+ TGFG 

Sbjct: 1 MVTLIX5SSLTTADVARVLFDFEEAAASEESMERVKKSRAAVERIVRDEKTIYGINTGFGK SO 

Query: 102 LCNVSISPEDTVQLQENLIRTHASGFGDPLPEDAVRAIMLIRINSLVKGYSGIRLSTIEK 1S1 

+V 1 ED+ LQ NLI +HA G GDP PE RA++L+R N+L+KG+SG+R IE+ 
Sbjct: 61 FSDVLIQKEDSAALQLNLILSHACGVGDPFPECVSRAMLLLRANALLKGFSGVRAELIEQ 120 

Query: 162 LLELLNKGVHPYIPEKGSLGASGDIAPLAHMVLPMLGLGKAYYKGELLSGQEALDKAGID 221 

LL LNK VHP IP++GSLGASGDLAPL+H+ L ++G G+ +++GE + L KAGI 

Sbjct: 121 LLAFLNKRWPVIPQCGSLGASGDLAPIjSHLALALIGQGEVFFEGERMPAMTGLKKAGIQ 180 

Query: 222 KISIAAKEGl^INGTTVLTAVGAIATYDAIQIiKLSDIAGALSLEVHNGITSPFEFJJLH 281 

++L +KEGLALINGT +TA+G +A +A +L ++ +L++E GI F+E++H 
Sbjct: 181 PVTLTSKEGLALINGTCAMTAMGWAYIEAEKIAYQTERIASLTIEGLQGIIDAFDEDIH 240 

Query: 282 TIRPQSGQIATARNIRNLLEGSQNTTVATQSRVQDPYTLRCMPQIHGASKDSIAYVKSKV 341 

R Q+ A IR L S TT 4 RVQD Y+LRC+PQ+HGA+ ++ YVK K+ 
Sbjct: 241 LARGYQEQIDVAERIRFYLSDSGLTTSQGEI.RVQDAYSLRCIPQVHGATWQTLGYVKEKL 300 

Query: 342 DIEINSWDNPIICKIX3-HVIS<MNFHGEPMAQPFDFIjGIAISEIGNVSERRVERLWSQ 400 

+ IE+N+ TDNP+I DG VISGGNFHG+P+A DFL IAISE+ N++ERR+ERLVN Q 
Sbjct: 301 EIEMNAATDNPLIFNDGDKVISGGNFHGQPIAFAMDFLKIAISELANIAERR1ERLVNPQ 360 

Query: 401 LSKLPSFLVKYPGI^SGFMITQYACASIASENKVLAHPASVDSIPSCENQEDFVSMGTTA 460 

L+ LP FL +PGL. SG MI QYA ASL SENK LAHPASVDSIPS NQED VSMGT A 
Sbjct: 361 LNDLPPFLSPHPGLQSGAMIMQYAAASLVSENKTLAHPASVDS I PSSANQEDHVSMGTIA 420 

Query: 461 ARKAFE I LKNSRRI VATE IMAACQALDLKPENHELGKGTKVAYDLFRKEVNF IEHDK 517 

AR A++++ N+RR++A E + A QA++ + H TK + RK V 1+ D+ 

Sbjct: 421 ARHAYQVIANTRRVIAIEAICALQAVEYRGIEH-AASYTKQLFQEMRKWPSIQQDR 476 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2937 

A DNA sequence (GASx2064) was identified in S.pyogenes <SEQ ID 8373> which encodes the amino acid 
sequence <SEQ ID 8374>. Analysis of this protein sequence reveals the following: 

Possible site: 44 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 4483 (Affirmative) < suco 

bacterial membrane -— Certainty=0 .0000 (Hot Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAG06563 GB:AE004741 probable arginase family protein 
[Pseudomonas aeruginosa] 
Identities = 99/275 (36%) , Positives = 147/275 (53%) , Gaps = 9/275 (3%) 
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Query: 53 LIGFKSDKGVYimGRVGA.VESPaAIRTQLAKFPWHLGNQVMVYDVGNIDGPNRSLEQLQ 112 

L+GF SD+GV N GR GA P A+R LA. WH G Q +YD G+I + LE Q 
Sbjct: 42 LLGFASDEGVRRNQGRQGARHGPPAIJ^PJU^IILAWH-GEQA-IYDAGDIVAGD-DLEAAQ 98 

Query: 113 NSLSKAI KRMCDUSILKP I VLGGGHETAYGHYLGLRQSIiSPSDDIj AVINMDAHFDLRP 169 

++ + + + + LGGGHE AY + GL + LS + L ++N DAHFDLR 

Sbjct: 99 ECYAQRVADLLACGHRWGLGGGHEIAYASFAG1ARHLSRHERLPRIGILNFDAHFDLRH 158 

Query: 170 YDQTGPNSGTGFRQMFDDAVADKSLFKYFVLGIQEHNNKLFLFDFVAKSKGIQFLTGQDI 229 

++ +SGT FRQ+ + A F Y LGI +N LFD A+ G+++L + + 
Sbjct: 159 AERA--SSGTPFRQIAELCQASDWPFAYCCLGISRLSNTAALFD-QAQRLGVRYLLDRQL 215 

Query: 230 YQMGHQKVCRAIDRFLEGQERVYLTIDMDCFSVGAAPGVSAIQSLGVDPNLAVLVLQHIA 289 

++ +D FL+ + +YLT+ +D APGVSA + GV+ + +++ 

Sbjct: 216 QPWNLERSEAFLDGFLQSVDHLYLTVCLDVLPAAQAPGVSAPSAHGVEMPWEHLVRRAK 275 



Sbjct: 276 ASGKLRLADIAELNPQLDSDQRTARIAARLVDSLV 310 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2938 

A DNA sequence (GASx2065R) was identified in S.pyogenes <SEQ ID 8375> which encodes the amino 
acid sequence <SEQ ID 8376>. Analysis of this protein sequence reveals the following: 

Possible site: 27 



- Final Results 

bacterial membrane Certainty=0 .1150 (Affirmative) . 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < i 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < i 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB37582 GB:AL035569 putative regulatory protein [Streptomyces 
coelicolor A3 (2)] 

Identities = 95/437 (21%) , Positives = 177/437 (39%) , Gaps = 28/437 (6%) 

EVGALLLIGDTGIGKRTLARQVLANQTQTFQI VTAKCFREEAMDSL- -LPWRNILDGLGD 32 ! 
E ALLL G+ G+GK L + A + +V EDLP+LL 





271 


Sbjct: 


95 




329 


Sbjct: 


155 




388 


Sbjct: 


215 




441 


Sbjct: 


274 




501 


Sbjct: 


329 



+V+++ED HW D 
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Query: 561 NLGHYYILVEESVGEEVLISFRQRIIQLYSYDRLSLSKRRLLHGQIAKRLEDLLPILTPS 620 

+ + IL+ G+ FR +++ D L +R L+ + A+ L D PL P+ 

Sbjct: 389 SAVNANILLPAPDGDG- -YRFRHSLVREAVGDDLjPSERSRLNRRYAEAL-DADPTLVPA 445 

Query: 621 PHBLDDIAYHYQESRQVIKALEYNLNYLDATLPFQHELFPIYSKSIGSLEKSDRDHQRLM 680 

+ +A ++ + KAL LDA++ + YS+ + LE++ L 
Sbjct: 446 AERVMRLAS YWYHAHAPAKALP AVLDASVEARRR — HAYSEQLRIiIiERA MELW 496 

Query: 681 EEQFDKIRQSIADLELT 697 

Sbjct: 497 DSAPDDVRATLRPVDCT 513 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2939 

A DNA sequence (GASx2072) was identified in S.pyogenes <SEQ ID 8377> which encodes the amino acid 
sequence <SEQ ID 8378>. Analysis of this protein sequence reveals the following: 
Possible site: 14 

»> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3702 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
antigens for vaccines or diagnostics. 

Example 2940 

A DNA sequence (GASx2074R) was identified in S.pyogenes <SEQ ID 8379> which encodes the amino 
acid sequence <SEQ ID 8380>. Analysis of this protein sequence reveals the following: 

Possible site: 37 



Final Results 

bacterial membrane Certainty=0. 1362 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or 
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Example 2941 

A DNA sequence (GASx2075R) was identified in S.pyogenes <SEQ ID 8381> which encodes the amino 
acid sequence <SEQ ID 8382>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

5 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 3 545 (Affirmative) < suco 

10 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2942 

A DNA sequence (GASx2076R) was identified in S.pyogenes <SEQ ID 8383> which encodes the amino 
acid sequence <SEQ ID 8384>. Analysis of this protein sequence reveals the following: 

20 Possible site: 34 

>» Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm Certainty=0. 2340 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
30 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAC44494 GB:U44893 orfl08; unknown function [Butyrivibrio 
f ibrisolvens] 
Identities = 42/75 (56%) , Positives = 55/75 (73%) 

35 Query: 1 LLKGTLRFGQLKSSIGSVSQKVLTAQLRAKEADGLVHREVYAEVPPRVEYSLTETGLSLA 60 

LL RF +LK+++ +SQKVLT LR+ME DG++ R VY EVPPRVEYSL+E G S+ 
Sbjct: 31 LliWPWRFtlELKNNLEGISQKVLTDStjRSMEEDGIITRTVYPEVPPRVEYSIjSELGESMR 90 

Query: 61 PVIEAMSDWGQTYQE 75 
40 P+I+AM WG Y+E 

Sbjct: 91 PIIKAMEQWGTEYKE 105 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2943 

A DNA sequence (GASx2097) was identified in S.pyogenes <SEQ ID 8385> which encodes the amino acid 
sequence <SEQ ID 8386>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

50 »> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood = -3.40 Transmembrane 26 - 42 ( 23 - 44) 
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Final Results 

bacterial membrane Certainty=0. 2359 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 {Not Clear) < suco 

5 bacterial cytoplasm — Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could he useful 
10 antigens for vaccines or diagnostics. 

Example 2944 

A DNA sequence (GASx2098) was identified in S.pyogenes <SEQ ID 8387> which encodes the amino acid 
sequence <SEQ ID 8388>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

15 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm --- Certainty=0. 1385 (Affirmative) < suco 

20 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

25 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2945 

A DNA sequence (GASx2100) was identified in S.pyogenes <SEQ ID 8389> which encodes the amino acid 
sequence <SEQ ID 8390>. Analysis of this protein sequence reveals the following: 

30 Possible site: 23 

»> Seems to have no N-terminal signal sequence 

Final Results 

35 bacterial cytoplasm --- Certainty=0. 2138 (Affirmative) < suco 

bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside — Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 
40 The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAA98589 GB:L44593 ORF79; putative [Lactococcus lactis phage 
BK5-T] 

Identities = 34/62 (54%) , Positives = 44/62 (70%) 

45 Query: 3 QITLKAARINAGYTLKQVAGAVGKNPQTISKYEKDSSDISLGLLQKLSSLYGVTIDNLFL 62 

+1 LKAAR NA ++ K+VA VGKN QTI YEKDS++1 + L KL+ +Y ID +FL 
Sbjct: 8 KIK1KAARTNADFSAKEVAEIVGKNYQTILSYEOSTEIPMSIAIKLAEIYDYPIDFIFL 67 

Query: 63 GK 64 
50 GK 

Sbjct: 68 GK 69 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2946 

5 A DNA sequence (GASx2103) was identified in S.pyogenes <SEQ ID 8391> which encodes the amino acid 
sequence <SEQ ID 8392>. Analysis of this protein sequence reveals the following: 

Possible site: 39 

>>> Seems to have no N-terminal signal sequence 

10 

Final Results 

bacterial cytoplasm --- Certainty=0. 3316 (Affirmative) < suco 
bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

15 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

20 Example 2947 

A DNA sequence (GASx2104) was identified in S.pyogenes <SEQ ID 8393> which encodes the amino acid 

sequence <SEQ ID 8394>. Analysis of this protein sequence reveals the following: 

Possible site: 55 

25 >>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 4371 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

30 bacterial outside Certainty=0 . 0000 (Not Clear) <; suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
35 antigens for vaccines or diagnostics. 

Example 2948 

A DNA sequence (GASx2105) was identified in S.pyogenes <SEQ ID 8395> which encodes the amino acid 
sequence <SEQ ID 8396>. Analysis of this protein sequence reveals the following: 

Possible site: 40 

40 

>» Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 2263 (Affirmative) 

bacterial membrane Certainty=0. 0000 (Not Clear) < i 

bacterial outside Certainty=0. 0000 (Not Clear) < ; 

No coixesponding DNA sequence was identified in S.agalactiae. 



WO 02/34771 



-2884- 



PCT/GB01/04789 



The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted mat this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2949 

5 A DNA sequence (GASx2106) was identified in S.pyogenes <SEQ ID 8397> which encodes the amino acid 
sequence <SEQ ID 8398>. Analysis of this protein sequence reveals the following: 

Possible sits: 32 

»> Seems to have an uncleavable N-term signal seq 
10 INTEGRAL Likelihood = -6.42 Transmembrane 9 - 25 ( 6-29) 

Final Results 

bacterial membrane Certainty=0. 3569 (Affirmative) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

15 bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
20 antigens for vaccines or diagnostics. 

Example 2950 

A DNA sequence (GASx2107) was identified in S.pyogenes <SEQ ID 8399> which encodes the amino acid 
sequence <SEQ ID 8400>. Analysis of this protein sequence reveals the following: 

Possible site: 25 

25 

>» Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 1355 (Affirmative) < suco 

30 bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

35 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2951 

A DNA sequence (GASx2108) was identified in S.pyogenes <SEQ ID 840 1> which encodes the amino acid 

sequence <SEQ ID 8402>. Analysis of this protein sequence reveals the following: 

40 Possible site: 26 

>>> Seems to have no N-terminal signal sequence 

Final Results 

45 bacterial cytoplasm -— Certainty=0 .3050 (Affirmative) < suco 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2952 

A DNA sequence (GASx2109) was identified in S.pyogenes <SEQ ID 8403> which encodes the amino acid 
sequence <SEQ ID 8404>. Analysis of this protein sequence reveals the following: 
Possible site: 13 

»> Seems to have no N-terminal signal sequence 



Final Results 

bacterial cytoplasm Certainty=0. 3628 (Affirmative) < suco 

bacterial membrane — Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAB46557 GB:AJ242479 putative replication protein [Streptococcus thermophilus] 
Identities = 143/242 (59%), Positives = 180/242 (74%), Gaps - 2/242 (0%) 



Query: 59 EKNGSYKRNNASLVYRDLIFLDYDEIETGVNLPKIVSQTLWEYSYIIYPTIKHTPEKPRY 118 

+++G+YKRNNASL+YRDLI FLDYD++E + P+ VS L YSY+IYPTIKHT EKPRY 
Sbjct: 61 DEDGNYKRNNASLLYRDLIFLDYDKLEASTDFPRAVSNALNGYSYVIYPTIKHTAEKPRY 120 

Query: 119 RLVWKPSDVMTFATYKQWKEIADKIGLPFDLASLTWSQLQGLPVTTGDPEDYQRYVNHG 178 

RDV+KP+D M E TYK +EIADKIGLPFD +SLTWSQLQGLPVTTGDPE Y+R VN G 
Sbjct: 121 RLWKPTDKTC1EQTYKATAQEIADKIGLPFDDSSLTWSQLQGLPVTTGDPEKYERIVNRG 180 

Query: 179 rjDYPVPKNGSTPNRQWTTYTPRPRSQRSITMRVIDTLFNGFGNEGGRNVALTKFVGLLF 238 

YPV + +TPR +S+TMRV+DTL NGFG+EGGRN+ +T+FVGLL 

Sbjct: 181 RCYPVANPNTWANHSPNYHTPRQSGDKSLTmVVDTLMGFGDEGGRNIEVTRFVGLLL 240 

Query: 239 NK 240 
+K 

Sbjct: 241 SK 242 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2953 

A DNA sequence (GASx21 10) was identified in S.pyogenes <SEQ ID 8405> which encodes the amino acid 
sequence <SEQ ID 8406>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

»> Seems to have no N-terminal signal sequence 

Final Results — 

bacterial cytoplasm Certainty=0 .5215 (Affirmative) < suco 
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- Certainty=0. 0000 (Not Clear) < suco 

- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 











































Query: 


304 




307 


Query: 


364 


Sb j ct : 


367 


Query: 


424 


Sb j ct : 


426 


Query: 


482 


Sbj ct : 


486 




542 


Sbjct: 


546 



E AK++G++A + A PKTPL A LK Y+RFIR++P 



+F TSIL+GK++VIGDDIQKD VIKDTS +FSIATGD + IEDKGKRPY+ R MTWQS 



; PRMN D+ AI+RRFR+L F+++ KGK +K I+NDY+ RKEVLEY +KLAIETP D 



+NP+ SIE L+E +KEMNPV DFV +FF DE + ++PN +V+ +K + 



V R LHK+IK LP+ F 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2954 

A DNA sequence (GASx21 11) was identified in S.pyogenes <SEQ ID 8407> which encodes the amino acid 
sequence <SEQ ID 8408>. Analysis of this protein sequence reveals the following: 

Possible site: 41 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0 . 0994 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 
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No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
5 antigens for vaccines or diagnostics. 

Example 2955 

A DNA sequence (GASx2112) was identified in S.pyogenes <SEQ ID 8409> which encodes the amino acid 
sequence <SEQ ID 841 0>. Analysis of this protein sequence reveals the following: 

Possible site: 54 

10 

>>> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3058 (Affirmative) < suco 

15 bacterial membrane --- Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

20 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2956 

A DNA sequence (GASx2114) was identified in S.pyogenes <SEQ ID 841 1> which encodes the amino acid 
sequence <SEQ ID 8412>. Analysis of this protein sequence reveals the following: 

25 Possible site: 37 

>>> Seems to have no N-terminal signal sequence 

Final Results 

30 bacterial cytoplasm — Certainty=0 .2815 (Affirmative) < suco 
bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

35 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2957 

A DNA sequence (GASx2115R) was identified in S.pyogenes <SEQ ID 8413> which encodes the amino 
40 acid sequence <SEQ ID 841 4>. Analysis of this protein sequence reveals the following: 

Possible site: 27 

»> Seems to have an uncleavable N-term signal seq 

45 Final Results 

bacterial membrane Certainty=0.0000 (Not Clear) < suco 
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bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

5 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2958 

A DNA sequence (GASx21 16) was identified in S.pyogenes <SEQ ID 841 5> which encodes the amino acid 
10 sequence <SEQ ID 8416>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

>>> Seems to have no N-terminal signal sequence 

15 Final Results 

bacterial cytoplasm Certainty=0. 4213 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

20 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2959 

25 A DNA sequence (GASx21 17) was identified in S.pyogenes <SEQ ID 841 7> which encodes the amino acid 
sequence <SEQ ID 841 8>. Analysis of this protein sequence reveals the following: 

Possible site: 20 

>» Seems to have no N-terminal signal seouence 

30 

Final Results 

bacterial cytoplasm — Certainty=0. 3091 (Affirmative) < suco 
bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

35 

No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

40 Example 2960 

A DNA sequence (GASx2118) was identified in S.pyogenes <SEQ ID 841 9> which encodes the amino acid 
sequence <SEQ ID 8420>. Analysis of this protein sequence reveals the following: 

Possible site: 41 
45 >» Seems to have an uncleavable N-term signal seq 
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bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

10 Example 2961 

A DNA sequence (GASx21 19) was identified in S.pyogenes <SEQ ID 842 1> which encodes the amino acid 
sequence <SEQ ID 8422>. Analysis of this protein sequence reveals the following: 

Possible site: 22 

15 »> Seems to have no N- terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 2531 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

20 bacterial outside — Certainty=0.0000(Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF63071 GB:AF153600 gpl37 [Streptococcus thermophilus 
25 bacteriophage Sfill] 

Identities = 41/121 (33%) , Positives = 65/121 (52%) , Gaps = 3/121 (2%) 

Query: 4 KNAIRKLKEFHRWQRIAN- SLDLTYTELYQFDIEYHPTRR- - KHLEISRECALEELDAIR 60 
K RKL+E+ RW+ IA+ S + T4 + F + +++ + R AL EL+AI 

30 Sbjct: 13 KRCKRKLREYPRWREIAHDSAEQKITQEFTFMPRGGGVNKPVENIAVRRVDALNELEAIE 72 

Query: 61 YAINQLSKVEYRQ1LIECYLISEEKTQQDIMEELNGSQSWYYESKKRALLEFVEFYRDGAL 121 

A+N L + +YR+ILIE YL K I + + ++ + E ++L F E YRDG L 
Sbjct: 73 QAVNGLYRPDYRR1LIEKYLAYPPKPNWQIAQSIGFERTAFQELLNNSILAFAELYRDGRL 133 

35 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2962 

A DNA sequence (GASx2120) was identified in S.pyogenes <SEQ ID 8423> which encodes the amino acid 
40 sequence <SEQ ID 8424>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

>>> Seems to have no N-terminal signal sequence 

45 Final Results 

bacterial cytoplasm Certainty=0 .2666 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

50 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2963 

A DNA sequence (GASx2121) was identified in S.pyogenes <SEQ ID 8425> which encodes the amino acid 
5 sequence <SEQ ID 8426>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

»> Seems to have a cleavable N-term signal seq. 

10 Final Results 

bacterial outside Certainty=0. 3000 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

15 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2964 

20 A DNA sequence (GASx2123R) was identified in S.pyogenes <SEQ ID 8427> which encodes the amino 
acid sequence <SEQ ID 8428>. Analysis of this protein sequence reveals the following: 
Possible site: 21 

»> Seems to have no N-terminal signal sequence 

25 

Final Results 

bacterial cytoplasm Certainty=0 .3441 (Affirmative) < suco 

bacterial membrane — Certainty=0 . 0000 (Not Clear) < suco 
bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

35 Example 2965 

A DNA sequence (GASx2132) was identified in S.pyogenes <SEQ ID 8429> which encodes the amino acid 
sequence <SEQ ID 8430>. Analysis of this protein sequence reveals the following: 

Possible site: 31 

40 »> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 .0000 (Not Clear) < suco 

bacterial outside — - Certainty=0. 0000 (Not Clear) < suco 

45 bacterial cytoplasm --- Certainty=0.0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 



WO 02/34771 



-2891- 



PCT/GB01/04789 



The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2966 

5 A DNA sequence (GASx2136) was identified in S.pyogenes <SEQ ID 843 1> which encodes the amino acid 
sequence <SEQ ID 8432>. Analysis of this protein sequence reveals the following: 

Possible site: 30 

»> Seems to have no N-terminal signal sequence 
10 INTEGRAL Likelihood = -3.19 Transmembrane 57 - 73 ( 54 - 73) 

Final Results 

bacterial membrane — Certainty=0. 2275 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

15 bacterial cytoplasm Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAB18271 GB:U74623 Cadx [Staphylococcus lugdunensis] 
20 Identities = 50/110 (45%) , Positives = 76/110 (68%) 

Query: 11 MKKDS I CQVGVINQQNVTTATNYLEKEKVQKSLRILSKFTDNKQINI I FYLLAVEELCVC 70 

M ++ C V +++ V A ++LE +K +K L IL K D K++ II L+ +ELCVC 
Sbjct: 1 MSYENACDVICVHEDKWINALSFLEDDKSKKLLNILEKICDEKKLKIILSLIKEDELCVC 60 

25 

Query: 71 DIACLLNLSMASASHHLRKLANQNILDTRREGKIIYYFIKDEEIRDFFNQ 120 

DI+ +L +S+AS SHHLR L ++LD ++GK+ YYFIKD+EIR+FF++ 
SbjCt: 61 DISLILKMSVASTSHHLRLLYKNDVLDFYKKGKMAYYFIKDDEIREFFSK 110 

30 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2967 

A DNA sequence (GASx2137) was identified in S.pyogenes <SEQ ID 8433> which encodes the amino acid 
sequence <SEQ ID 8434>. Analysis of this protein sequence reveals the following: 

35 Possible site: 49 

»> Seems to have no N-terminal signal sequence 

Final Results 

40 bacterial cytoplasm — Certainty=0. 4582 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

45 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2968 

A DNA sequence (GASx2139) was identified in S.pyogenes <SEQ ID 8435> which encodes the amino acid 
sequence <SEQ ID 8436>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

5 

»> Seems to have a cleavable N-term signal seq. 

INTEGRAL Likelihood = -5.89 Transmembrane 63 - 79 ( 54 - 80) 

Final Results 

10 bacterial membrane Certainty=0. 3357 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 {Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S. agalactiae. 

15 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2969 

A DNA sequence (GASx2141R) was identified in S.pyogenes <SEQ ID 8437> which encodes the amino 
20 acid sequence <SEQ ID 8438>. Analysis of this protein sequence reveals the following: 

Possible site: 19 

>» Seems to have no N-terminal signal sequence 

25 Final Results 

bacterial cytoplasm — Certainty=0. 4663 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S. agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2970 

35 A DNA sequence (GASx2142) was identified in S.pyogenes <SEQ ID 843 9> which encodes the amino acid 
sequence <SEQ ID 8440>. Analysis of this protein sequence reveals the following: 

Possible site: 29 





have a cleavable N-term signal seq. 










INTEGRAL 


Likelihood =-10 


08 


Transmembrane 


143 


159 


135 


165) 


INTEGRAL 


Likelihood = -7 


64 


Transmembrane 


53 


69 


49 


79 


INTEGRAL 


Likelihood = -7 


17 


Transmembrane 


252 


- 268 


248 


275 


INTEGRAL 


Likelihood = -6 


74 


Transmembrane 


186 


202 


183 


208 


INTEGRAL 


Likelihood = -5 


63 


Transmembrane 


220 


- 236 


218 


240 


INTEGRAL 


Likelihood = -5 


26 


Transmembrane 




- 132 


115 


136 


INTEGRAL 


Likelihood = -2 


02 


Transmembrane 


85 


- 101 


85 




INTEGRAL 


Likelihood = -0 


64 


Transmembrane 


165 


- 181 


165 


181 



Final Results 

50 bacterial membrane — Certainty=0. 5034 (Affirmative) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 
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bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 


9 


FKGMIIALGFILPGVSGGVLflAILGIYERKISFLAHMRDNFIENVLFFLPVGIG- - -GIL 


55 






F G+++ + ++PGVSGG +A ++G+YE++I + ++ +PVG G G+ 




Sbjct: 


7 


FSGVLMGIJyWVPGVSGGTIAVLMGVYEKLIESWSFFHGNSRSLKVLIPVGAGVLVGVF 


66 




66 


GIALFSFPVEFLLKHYQVSVLWGFAGAI VGT IPSLI XESTKQSQRDKADWLWLVLTFVIS 


125 






GIA F +E L Y V + F G I I S +K TK+ K + + FV+ 




Sbjct: 


67 


GIARF- - -LEIFLSKYPVPTHFFFLGLI- - - IVSFVK- -TKEYFSIKP VNIFFVLL 


114 


Query: 


126 


GLGLYFLNDLIG--TLPANFLTFIIAGALIALGVLVPGLSPSNLLLILGLYGPMLIGFKS 


183 






G+ L F+ G T + +L G + A ++VPG+S S +LLI G+Y +L 




Sbj Ct : 


115 


GMFLIFMLHFSGETTAKESMFLLVLGGFVARTAMWPGISGSLILLIFGVYDHVLYLVSH 


174 


Query: 


184 


LDLLGTFLP1AIGGVIAILAFSKSMDYALQHHHSKVYHFIIGIVLSSTLLILIPNSSSPE 


243 






L ++G L +IG V IL K M++ L+ + Y FI G++L+S h ++P + 




Sbjct: 


175 


L-IIGELLIFSIGWAGILVSVKIMNFLLKRFREETYSFIGGMILAS-LYEVLPKKMTTTN 


232 




244 


SISYSHAGILTWLMA.FVLFALGIWLGLWMSQLEEK 278 








+ L + + L + LG ++ +E+K 




Sbjct: 


233 







Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

30 Example 2971 

A DNA sequence (GASx2143R) was identified in S.pyogenes <SEQ ID 8441> which encodes the amino 
acid sequence <SEQ ID 8442>. Analysis of this protein sequence reveals the following: 

Possible site: 20 



Final Results 

bacterial cytoplasm Certainty=0.3S64 (Affirmative) < succ; 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

40 bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 



No corresponding DNA sequence was identified in S. agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:BAB05000 GB:AP001511 unknown conserved protein in others 
[Bacillus halodurans] 
Identities = 28/78 (35%) , Positives = 37/78 (46%) 

Query: 44 EVDK^FIVPLRQLLFTDPVYYRLEVTPIETTDFPFDRIRNGKYYQFSQEYRSIPFYENLE 103 

EVD VF VP+ + p YR+ V FP +RI N YQ S + FY 

Sbjct: 127 EVDHVFTVPIDHFISHPPEQYRINVHFEPGAGFPIERIANQSAYQKSTRQITESFYYYQS 186 



Query: 104 ETIWGMTAQFTKCLTDIL 121 

IWG+TA+ + + IL 
Sbjct: 187 YVIWGLTAKILRHVITIL 204 

55 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2972 

A DNA sequence (GASx2144R) was identified in S.pyogenes <SEQ ID 8443> which encodes the amino 
acid sequence <SEQ ID 8444>. Analysis of this protein sequence reveals the following: 

Possible site: 17 

5 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 4761 (Affirmative) < suco 
10 bacterial membrane --- Certainty=0. 0000 (Not Clear) < suco 
bacterial outside Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2973 

A DNA sequence (GASx2145) was identified in S.pyogenes <SEQ ID 8445> which encodes the amino acid 
sequence <SEQ ID 8446>. Analysis of this protein sequence reveals the following: 

20 Possible site: 25 

>>> Seems to have an uncleavable N-term signal seq 

INTEGRAL Likelihood = -4.09 Transmembrane 2 - 18 ( 1-19) 

25 Final Results 

bacterial membrane Certainty=0 .2635 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

30 No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:CAA49519 GB:X69895 X [Bacillus sphaericus] 
Identities = 40/97 (41%) , Positives = 57/97 (58%) , Gaps = 5/97 (5%) 

35 Query: 10 IEFLILAIVEKNDSYGYDISQTIKLVAN IKESTLYP I LKKLEKAGFLTTYSQE - HQ 64 

++ +IL ++ + D YGY+ISQ I N IKE+TLY + ++LEK + Y + 
Sbjct: 11 LDSIILRLILEKDRYGYEISQEISNRTNNSFQIKFATLYAVFQRLEKKEVIFAYYGDVSD 70 

Query: 65 GRKRKYYAVTSSGRAQLIFLKKEWQSYKFALDGIIEG 101 
40 G KRKYY +TS G+A L L KEW K +D +EG 

Sbjct: 71 GGKRKYYRITSLGKAYLSELVKEWAEVKEIIDLFMEG 107 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

45 Example 2974 

A DNA sequence (GASx2146) was identified in S.pyogenes <SEQ ID 8447> which encodes the amino acid 
sequence <SEQ ID 8448>. Analysis of this protein sequence reveals the following: 

Possible site: 56 

50 »> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-14.75 Transmembrane 97 - 113 ( 77 - 143) 
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INTEGRAL Likelihood = -6.85 Transmembrane 116 - 132 ( 114 - 143) 
INTEGRAL Likelihood = -5.68 Transmembrane 156 - 172 ( 149 - 175) 
INTEGRAL Likelihood = -5.47 Transmembrane 79 - 95 { 77 - 96) 

Final Results 

bacterial membrane Certainty=0. 6901 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2975 

A DNA sequence (GASx2147) was identified in S.pyogenes <SEQ ID 8449> which encodes the amino acid 
sequence <SEQ ID 845 0>. Analysis of this protein sequence reveals the following: 
Possible site: 31 

in uncleavable N-terra signal seq 

24 ( 6 - 30) 

Final Results 

bacterial membrane Certainty=0. 3845 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0. 0000 (Not Clear) «; suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 

>GP:AAF04457 GB:AF078161 lacunin [Manduca sexta] 
Identities = 68/310 (21%), Positives = 117/310 (36%), Gaps = 12/310 (3%) 

DIDSSASTITVETGPVQRPTVTYYTHPKLIDPIVTTVTGKTLSLSQTPKDWITGGIEIL 114 
DI+ + ++ + E+ T++ T + TTTT+ST+I + 

DIEGTTASGSTESTFTDETTMSKVTEESSVAEEETTKTTITEEVSGTSESASINSDKTTM 1063 

GFTLNNSRQEKNYRS IT - - 1 TVPEKTSLNEVKASNVPHTTLSNLT - - VQDMQFDGNLTLL 170 
++ + IT +TV E+TS TT+S ++ + T 



E T S T + ++ S 





55 


Sbjct: 


1004 




115 


Sbjct: 


1064 




171 


Sbjct: 


1124 




230 


Sbjct: 


1184 


Query: 


289 


Sbjct: 


1238 




349 


Sbjct: 


1298 



Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2976 

A DNA sequence (GASx2148R) was identified in S.pyogenes <SEQ ID 845 1> which encodes the amino 
acid sequence <SEQ ID 8452>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

5 

»> Seems to have an uncleavable N-term signal seq 

Final Results 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

10 bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

15 Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2977 

A DNA sequence (GASx2160) was identified in S.pyogenes <SEQ ID 8453> which encodes the amino acid 

sequence <SEQ ID 8454>. Analysis of this protein sequence reveals the following: 

20 Possible site: 29 

»> Seems to have no N-terminal signal sequence 

Final Results 

25 bacterial cytoplasm — Certainty=0. 1630 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0. 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

30 The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2978 

A DNA sequence (GASx2170R) was identified in S.pyogenes <SEQ ID 8455> which encodes the amino 
35 acid sequence <SEQ ID 845 6>. Analysis of this protein sequence reveals the following: 

Possible site: 3 7 

»> Seems to have no N-terminal signal sequence 

INTEGRAL Likelihood =-13.32 Transmembrane 181 - 197 ( 175 - 203) 

40 

Final Results 

bacterial membrane Certainty=0. 6328 (Affirmative) < suco 

bacterial outside certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

45 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 
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Example 2979 

A DNA sequence (GASx2174) was identified in S.pyogenes <SEQ ID 845 7> which encodes the amino acid 
sequence <SEQ ID 8458>. Analysis of this protein sequence reveals the following: 

Possible site: 28 

in uncleavable N-term signal seq 

19 ( 3 - 19) 

Final Results 

bacterial membrane Certainty=0 . 1956 (Affirmative) < suco 

bacterial outside Certainty=0 . 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2980 

A DNA sequence (GASx2181R) was identified in S.pyogenes <SEQ ID 8459> which encodes the amino 
acid sequence <SEQ ID 8460>. Analysis of this protein sequence reveals the following: 

Possible site: 24 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm Certainty=0. 3751 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside Certainty-0 . 0 0 0 0 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2981 

A DNA sequence (GASx2185R) was identified in S.pyogenes <SEQ ID 8461> which encodes the amino 
acid sequence <SEQ ID 8462>. Analysis of this protein sequence reveals the following: 

Possible site: 26 



I signal sequence 



INTEGRAL Likelihood = -0.90 Transmembrane 18 - 34 ( 18 - 34) 

Final Results 

bacterial membrane Certainty=0. 1362 (Affirmative) < suco 

bacterial outside Certainty=0. 0000 (Not Clear) < suco 

bacterial cytoplasm Certainty=0 .0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has no significant homology with any sequences in the GENPEPT database. 
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Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2982 

A DNA sequence (GASx2I86R) was identified in S.pyogenes <SEQ ID 8463> which encodes the amino 
acid sequence <SEQ ID 8464>. Analysis of this protein sequence reveals the following: 

Possible site: SI 

>>> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 4803 (Affirmative) < suco 

bacterial membrane Certainty=0 . 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0 . 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 VSMKPIDLSK^SIRKRSKKWIKTNKKTLGKSIEERPSYINDRSEFGHWEIDLALGKKTK 60 

+ +K IDL + V IRK+ K T KK LGKSIEERPE IN+RS FG WEID LG KT 
Sbjct: 150 LEIKVIDLPRAVRIRKKFTKRPST-KKHLGKSIEERPEEINNRSRFGDWEIDSVLGGKTI 208 

Query: 61 SEAVMLTLVERQTRYAL 77 

E +LTLVERQTRYA+ 
SbjCt: 209 GEPSILTLVERQTRYAV 225 

Based on this analysis, it was predicted that this GAS-specific protein and its epitopes, could be useful 
antigens for vaccines or diagnostics. 

Example 2983 

A DNA sequence (GASx2187R) was identified in S.pyogenes <SEQ ID 8465> which encodes the amino 
acid sequence <SEQ ID 8466>. Analysis of this protein sequence reveals the following: 

Possible site: 50 

»> Seems to have no N-terminal signal sequence 

Final Results 

bacterial cytoplasm — Certainty=0. 3287 (Affirmative) < suco 

bacterial membrane Certainty=0. 0000 (Not Clear) < suco 

bacterial outside --- Certainty=0, 0000 (Not Clear) < suco 

No corresponding DNA sequence was identified in S.agalactiae. 

The protein has homology with the following sequences in the GENPEPT database: 



Query: 1 MNMSNINSTRKSSYSHLSATERGEIAAYLKKGKKPVEIARLLGSHRSTICREIKRGSVDQ 60 

MNMS ST SY HLS ERGEI AYL +G KP EIAR LG +RSTI REI RGS+ Q 
Sbjct: 1 MNMSTNYSTTNQSYKHLSEAERGEIEAYLSVGLKPAEIARRLGRNRSTITREINRGSITQ 60 

Query: 61 VKDKNGKQTFFNAYFADSRQRVYETNR 87 

VK NG++ ++ Y+AD+ Y R 
Sbjct: 61 VKKVNGQKVYYQHYYADAAHNRYRHAR 87 



